Upload 13 files

Browse files

Files changed (6) hide show

README.md +48 -60
config.json +1 -1
model.safetensors +1 -1
model_head.pkl +1 -1
tokenizer.json +2 -2
tokenizer_config.json +7 -0

README.md CHANGED Viewed

@@ -9,12 +9,11 @@ base_model: intfloat/multilingual-e5-small
 metrics:
 - accuracy
 widget:
-- text: 'query: Interessant. Hast du das schon mal ausprobiert?'
-- text: 'query: はい、持っていますよ。すぐにメールで送りますね。'
-- text: 'query: Va bene ci sentiamo dopo Marco buona giornata'
-- text: 'query: Ζητώ συγγνώμη, πρέπει να αποχωρήσω τώρα.'
-- text: 'query: Guten Morgen, Maria! Hast du die Präsentation für das Meeting heute
-    fertig?'
 pipeline_tag: text-classification
 inference: true
 ---
@@ -47,10 +46,10 @@ The model has been trained using an efficient few-shot learning technique that i
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
-| Label | Examples                                                                                                                                                                                            |
-|:------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| 0     | <ul><li>'query: สวัสดีค่ะ วันนี้เป็นอย่างไรบ้าง?'</li><li>'query: Jag förstår. Vad tycker du att vi ska göra nu?'</li><li>'query: Hej, wszystko w porządku. Właśnie dostałam nową pracę.'</li></ul> |
-| 1     | <ul><li>'query: Чудесно, доскоро!'</li><li>'query: Mama mă cheamă, trebuie să mă întorc acasă, pa.'</li><li>'query: Perdó, ja he de marxar.'</li></ul>                                              |
 ## Uses
@@ -70,7 +69,7 @@ from setfit import SetFitModel
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("setfit_model_id")
 # Run inference
-preds = model("query: はい、持っていますよ。すぐにメールで送りますね。")
 ```
 <!--
@@ -102,23 +101,23 @@ preds = model("query: はい、持っていますよ。すぐにメールで送
 ### Training Set Metrics
 | Training set | Min | Median | Max |
 |:-------------|:----|:-------|:----|
-| Word count   | 2   | 7.3663 | 21  |
 | Label | Training Sample Count |
 |:------|:----------------------|
-| 0     | 286                   |
-| 1     | 290                   |
 ### Training Hyperparameters
 - batch_size: (16, 2)
 - num_epochs: (1, 16)
-- max_steps: 2000
 - sampling_strategy: undersampling
 - body_learning_rate: (1e-05, 1e-05)
 - head_learning_rate: 0.001
 - loss: CosineSimilarityLoss
 - distance_metric: cosine_distance
-- margin: 0.1
 - end_to_end: False
 - use_amp: False
 - warmup_proportion: 0.1
@@ -128,50 +127,39 @@ preds = model("query: はい、持っていますよ。すぐにメールで送
 - load_best_model_at_end: True
 ### Training Results
-| Epoch  | Step | Training Loss | Validation Loss |
-|:------:|:----:|:-------------:|:---------------:|
-| 0.0002 | 1    | 0.3683        | -               |
-| 0.0125 | 50   | 0.3256        | -               |
-| 0.0250 | 100  | 0.211         | 0.1998          |
-| 0.0375 | 150  | 0.1668        | -               |
-| 0.0500 | 200  | 0.0788        | 0.0571          |
-| 0.0625 | 250  | 0.0644        | -               |
-| 0.0750 | 300  | 0.0232        | 0.0286          |
-| 0.0875 | 350  | 0.0024        | -               |
-| 0.1000 | 400  | 0.0014        | 0.0945          |
-| 0.1125 | 450  | 0.0007        | -               |
-| 0.1250 | 500  | 0.0008        | 0.1036          |
-| 0.1375 | 550  | 0.0005        | -               |
-| 0.1500 | 600  | 0.0005        | 0.098           |
-| 0.1625 | 650  | 0.0003        | -               |
-| 0.1750 | 700  | 0.0005        | 0.1056          |
-| 0.1875 | 750  | 0.0004        | -               |
-| 0.2000 | 800  | 0.0006        | 0.1044          |
-| 0.2124 | 850  | 0.0005        | -               |
-| 0.2249 | 900  | 0.0004        | 0.1072          |
-| 0.2374 | 950  | 0.0003        | -               |
-| 0.2499 | 1000 | 0.0001        | 0.0993          |
-| 0.2624 | 1050 | 0.0003        | -               |
-| 0.2749 | 1100 | 0.0003        | 0.1114          |
-| 0.2874 | 1150 | 0.0002        | -               |
-| 0.2999 | 1200 | 0.0002        | 0.1078          |
-| 0.3124 | 1250 | 0.0001        | -               |
-| 0.3249 | 1300 | 0.0002        | 0.0908          |
-| 0.3374 | 1350 | 0.0002        | -               |
-| 0.3499 | 1400 | 0.0002        | 0.1019          |
-| 0.3624 | 1450 | 0.0001        | -               |
-| 0.3749 | 1500 | 0.0002        | 0.11            |
-| 0.3874 | 1550 | 0.0002        | -               |
-| 0.3999 | 1600 | 0.0001        | 0.1031          |
-| 0.4124 | 1650 | 0.0001        | -               |
-| 0.4249 | 1700 | 0.0001        | 0.0996          |
-| 0.4374 | 1750 | 0.0002        | -               |
-| 0.4499 | 1800 | 0.0001        | 0.0903          |
-| 0.4624 | 1850 | 0.0002        | -               |
-| 0.4749 | 1900 | 0.0001        | 0.0901          |
-| 0.4874 | 1950 | 0.0002        | -               |
-| 0.4999 | 2000 | 0.0001        | 0.0854          |
 ### Framework Versions
 - Python: 3.10.11
 - SetFit: 1.0.3

 metrics:
 - accuracy
 widget:
+- text: 'query: Baiklah, kita cakap lagi nanti, Mark. Selamat hari!'
+- text: 'query: Tôi xin lỗi nhưng tôi phải đi'
+- text: 'query: 次回行くときは、私を連れて行ってください。もっと自然の中で活動したいと思っています。'
+- text: 'query: Entschuldigung, ich muss jetzt gehen.'
+- text: 'query: Buenos días, ¿cómo están ustedes?'
 pipeline_tag: text-classification
 inference: true
 ---
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
+| Label | Examples                                                                                                                                                         |
+|:------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| 0     | <ul><li>'query: Értem. Mit csinálunk most?'</li><li>'query: Ola Luca, que tal? Rematache o traballo?'</li><li>'query: Lijepo je. Hvala.'</li></ul>               |
+| 1     | <ul><li>'query: Жөнейін, кейін кездесеміз.'</li><li>'query: Така, ќе се видиме повторно.'</li><li>'query: ठीक है बाद में बात करते हैं मार्क अच्छा दिन'</li></ul> |
 ## Uses
 # Download from the 🤗 Hub
 model = SetFitModel.from_pretrained("setfit_model_id")
 # Run inference
+preds = model("query: Tôi xin lỗi nhưng tôi phải đi")
 ```
 <!--
 ### Training Set Metrics
 | Training set | Min | Median | Max |
 |:-------------|:----|:-------|:----|
+| Word count   | 2   | 7.2168 | 25  |
 | Label | Training Sample Count |
 |:------|:----------------------|
+| 0     | 346                   |
+| 1     | 346                   |
 ### Training Hyperparameters
 - batch_size: (16, 2)
 - num_epochs: (1, 16)
+- max_steps: 1400
 - sampling_strategy: undersampling
 - body_learning_rate: (1e-05, 1e-05)
 - head_learning_rate: 0.001
 - loss: CosineSimilarityLoss
 - distance_metric: cosine_distance
+- margin: 0.05
 - end_to_end: False
 - use_amp: False
 - warmup_proportion: 0.1
 - load_best_model_at_end: True
 ### Training Results
+| Epoch      | Step     | Training Loss | Validation Loss |
+|:----------:|:--------:|:-------------:|:---------------:|
+| 0.0004     | 1        | 0.3607        | -               |
+| 0.0179     | 50       | 0.3254        | -               |
+| 0.0357     | 100      | 0.2303        | 0.2049          |
+| 0.0536     | 150      | 0.106         | -               |
+| 0.0714     | 200      | 0.1294        | 0.0748          |
+| 0.0893     | 250      | 0.087         | -               |
+| 0.1071     | 300      | 0.0732        | 0.0787          |
+| 0.1250     | 350      | 0.0019        | -               |
+| 0.1428     | 400      | 0.0027        | 0.1072          |
+| 0.1607     | 450      | 0.0015        | -               |
+| 0.1785     | 500      | 0.0008        | 0.0999          |
+| 0.1964     | 550      | 0.0016        | -               |
+| 0.2142     | 600      | 0.0004        | 0.1215          |
+| 0.2321     | 650      | 0.0012        | -               |
+| 0.2499     | 700      | 0.0008        | 0.1267          |
+| 0.2678     | 750      | 0.0005        | -               |
+| 0.2856     | 800      | 0.0003        | 0.1216          |
+| 0.3035     | 850      | 0.0003        | -               |
+| 0.3213     | 900      | 0.0004        | 0.1142          |
+| 0.3392     | 950      | 0.0004        | -               |
+| **0.3570** | **1000** | **0.0004**    | **0.0616**      |
+| 0.3749     | 1050     | 0.0002        | -               |
+| 0.3927     | 1100     | 0.0004        | 0.0946          |
+| 0.4106     | 1150     | 0.0002        | -               |
+| 0.4284     | 1200     | 0.0003        | 0.1091          |
+| 0.4463     | 1250     | 0.0002        | -               |
+| 0.4641     | 1300     | 0.0003        | 0.1141          |
+| 0.4820     | 1350     | 0.0004        | -               |
+| 0.4998     | 1400     | 0.0002        | 0.1209          |
+* The bold row denotes the saved checkpoint.
 ### Framework Versions
 - Python: 3.10.11
 - SetFit: 1.0.3

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "intfloat/multilingual-e5-small",
   "architectures": [
     "BertModel"
   ],

 {
+  "_name_or_path": "checkpoints/step_1000",
   "architectures": [
     "BertModel"
   ],

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ecfce4dd8b2e3179e859bc278ca2390319e04a66f3179fbbeb1bf7b598a86307
 size 470637416

 version https://git-lfs.github.com/spec/v1
+oid sha256:27c89f801f10bb9afe5e4f308a41a0d7492b8725340318de1847eec8f6b84cf1
 size 470637416

model_head.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:492fb3b7da876887807a7f0eb94fda6a77e65bbb7f72311fb8caaf601a46407c
 size 4608

 version https://git-lfs.github.com/spec/v1
+oid sha256:0b054fef0d715653a0dba9374d17ce2d5fa1a3fb6560f2768740890da80a0321
 size 4608

tokenizer.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:45b6ee00bc5023ac454b82c372ebe14b27866fa471b6dbb0d24e09b12909a1f4
-size 17083075

 version https://git-lfs.github.com/spec/v1
+oid sha256:55ce1a4600af70b33f5a7fba12dbb41a504d3c08737c9b26b5e7fd6e437a9a23
+size 17083087

tokenizer_config.json CHANGED Viewed

@@ -46,10 +46,17 @@
   "cls_token": "<s>",
   "eos_token": "</s>",
   "mask_token": "<mask>",
   "model_max_length": 512,
   "pad_token": "<pad>",
   "sep_token": "</s>",
   "sp_model_kwargs": {},
   "tokenizer_class": "XLMRobertaTokenizer",
   "unk_token": "<unk>"
 }

   "cls_token": "<s>",
   "eos_token": "</s>",
   "mask_token": "<mask>",
+  "max_length": 512,
   "model_max_length": 512,
+  "pad_to_multiple_of": null,
   "pad_token": "<pad>",
+  "pad_token_type_id": 0,
+  "padding_side": "right",
   "sep_token": "</s>",
   "sp_model_kwargs": {},
+  "stride": 0,
   "tokenizer_class": "XLMRobertaTokenizer",
+  "truncation_side": "right",
+  "truncation_strategy": "longest_first",
   "unk_token": "<unk>"
 }