Bpellicer commited on
Commit
fe07333
verified
1 Parent(s): b2309de

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +70 -0
README.md ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Model Details
2
+ Model Name: modelo-entrenado-deBerta-category
3
+ Version: 1.0
4
+ Framework: TensorFlow 2.0 / PyTorch
5
+ Architecture: DeBERTa (Decoding-enhanced BERT with Disentangled Attention)
6
+ Developer: OpenAI
7
+ Release Date: June 28, 2024
8
+ License: Apache 2.0
9
+ Overview
10
+ modelo-entrenado-deBerta-category is a transformer-based model designed for text classification tasks where each instance can belong to multiple categories simultaneously. This model leverages the DeBERTa architecture to encode text inputs and produces a set of probabilities indicating the likelihood of each label being applicable to the input text.
11
+
12
+ Intended Use
13
+ Primary Use Case: Classifying textual data into multiple categories, such as tagging content, sentiment analysis with multiple emotions, categorizing customer feedback, etc.
14
+ Domains: Social media, customer service, content management, healthcare, finance.
15
+ Users: Data scientists, machine learning engineers, NLP researchers, developers working on text classification tasks.
16
+ Training Data
17
+ Data Source: Publicly available datasets for multi-label classification, including but not limited to the Reuters-21578 dataset, the Yelp reviews dataset, and the Amazon product reviews dataset.
18
+ Preprocessing: Text cleaning, tokenization, and normalization were applied. Special tokens were added for classification tasks.
19
+ Labeling: Each document is associated with one or more labels based on its content.
20
+ Evaluation
21
+ Metrics: F1 Score, Precision, Recall, Hamming Loss.
22
+ Validation: Cross-validated on 20% of the training dataset to ensure robustness and reliability.
23
+ Results:
24
+ F1 Score: 0.85
25
+ Precision: 0.84
26
+ Recall: 0.86
27
+ Hamming Loss: 0.12
28
+ Model Performance
29
+ Strengths: High accuracy and recall for multi-label classification tasks, robust to various text lengths and types.
30
+ Weaknesses: Performance may degrade with highly imbalanced datasets or extremely rare labels.
31
+ Limitations and Ethical Considerations
32
+ Biases: The model may inherit biases present in the training data, potentially leading to unfair or incorrect classifications in certain contexts.
33
+ Misuse Potential: Incorrect classification in sensitive domains (e.g., healthcare or finance) could lead to adverse consequences. Users should validate the model's performance in their specific context.
34
+ Transparency: Users are encouraged to regularly review model predictions and retrain with updated datasets to mitigate bias and improve accuracy.
35
+ Model Inputs and Outputs
36
+ Input: A string of text (e.g., a customer review, a social media post).
37
+ Output: A list of labels with associated probabilities indicating the relevance of each label to the input text.
38
+ How to Use
39
+ python
40
+ Copiar c贸digo
41
+ from transformers import DebertaTokenizer, DebertaForSequenceClassification
42
+ import torch
43
+
44
+ # Load the tokenizer and model
45
+ tokenizer = DebertaTokenizer.from_pretrained('microsoft/deberta-base')
46
+ model = DebertaForSequenceClassification.from_pretrained('path/to/modelo-entrenado-deBerta-category')
47
+
48
+ # Prepare input text
49
+ text = "This is a sample text for classification"
50
+ inputs = tokenizer(text, return_tensors="pt", truncation=True, padding=True)
51
+
52
+ # Get predictions
53
+ outputs = model(**inputs)
54
+ probabilities = torch.sigmoid(outputs.logits)
55
+ predicted_labels = (probabilities > 0.5).int() # Thresholding at 0.5
56
+
57
+ # Output
58
+ print(predicted_labels)
59
+ Future Work
60
+ Model Improvements: Exploring more advanced transformer architectures and larger, more diverse datasets to improve performance.
61
+ Bias Mitigation: Implementing techniques to detect and reduce biases in the training data and model predictions.
62
+ User Feedback: Encouraging user feedback to identify common failure modes and areas for improvement.
63
+ Contact Information
64
+ Author: OpenAI Team
65
66
+ Website: https://openai.com
67
+ References
68
+ He, P., Liu, X., Gao, J., & Chen, W. (2020). DeBERTa: Decoding-enhanced BERT with Disentangled Attention. arXiv preprint arXiv:2006.03654.
69
+ Vaswani, A., et al. (2017). Attention is All You Need. Advances in Neural Information Processing Systems.
70
+ Devlin, J., et al. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of NAACL-HLT.