End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0247
-- F1: 97.2892
-- Gen Len: 7.6006
 ## Model description
@@ -44,15 +44,19 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 ### Framework versions
 - Transformers 4.44.0
 - Pytorch 2.4.0
-- Datasets 3.0.1
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0423
+- F1: 97.5301
+- Gen Len: 2.6047
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 2
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1      | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| 0.0893        | 1.0   | 1385 | 0.0434          | 96.3938 | 2.6218  |
+| 0.0324        | 2.0   | 2770 | 0.0423          | 97.5301 | 2.6047  |
 ### Framework versions
 - Transformers 4.44.0
 - Pytorch 2.4.0
+- Datasets 3.1.0
 - Tokenizers 0.19.1

logs/events.out.tfevents.1734432976.c5d1fc1ff2cc.23.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:2fd3f1f1c4944dd361ce414961500ed1ce24dd70d488666708a6098c321de720
+size 456

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 8,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 8
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 3,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 3
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

tokenizer_config.json CHANGED Viewed

@@ -927,7 +927,7 @@
     "<extra_id_98>",
     "<extra_id_99>"
   ],
-  "clean_up_tokenization_spaces": false,
   "eos_token": "</s>",
   "extra_ids": 100,
   "model_max_length": 512,

     "<extra_id_98>",
     "<extra_id_99>"
   ],
+  "clean_up_tokenization_spaces": true,
   "eos_token": "</s>",
   "extra_ids": 100,
   "model_max_length": 512,