Update README.md
Browse files
README.md
CHANGED
@@ -37,8 +37,8 @@ This may be due to the base model being trained for emoji classification and the
|
|
37 |
This model is better if emojis are to be also included for sentiment analysis.
|
38 |
No Evaluation is done for data with only text and no emojis.
|
39 |
|
40 |
-
The model was fine-tuned with dataset: mteb/tweet_sentiment_extraction from
|
41 |
-
converted to
|
42 |
|
43 |
The model has a test loss of 0.6 and an f1 score of 0.74 on the unseen data from the dataset.
|
44 |
|
@@ -78,4 +78,7 @@ Text: tu mujhe pasandh heh
|
|
78 |
Negative: 0.01
|
79 |
Neutral: 0.22
|
80 |
Positive: 0.76
|
81 |
-
```
|
|
|
|
|
|
|
|
37 |
This model is better if emojis are to be also included for sentiment analysis.
|
38 |
No Evaluation is done for data with only text and no emojis.
|
39 |
|
40 |
+
The model was fine-tuned with the dataset: mteb/tweet_sentiment_extraction from hugging face
|
41 |
+
converted to Hinglish text.
|
42 |
|
43 |
The model has a test loss of 0.6 and an f1 score of 0.74 on the unseen data from the dataset.
|
44 |
|
|
|
78 |
Negative: 0.01
|
79 |
Neutral: 0.22
|
80 |
Positive: 0.76
|
81 |
+
```
|
82 |
+
Possible Future Direction:
|
83 |
+
|
84 |
+
1. Pre-train the Hinglish model with both Hindi, Hinglish, and English datasets. Current tokens for hinlish have very small sizes i.e. low-priority vocabs are used mostly.
|