End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 library_name: transformers
-license: mit
-base_model: gpt2
 tags:
 - generated_from_trainer
 datasets:
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
 # distilgpt2-finetuned-recipe-nlg-generator
-This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the recipe_nlg dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.1789
 ## Model description
@@ -38,19 +38,27 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 2
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.3823        | 1.0   | 250  | 2.1386          |
-| 0.1593        | 2.0   | 500  | 2.1789          |
 ### Framework versions

 ---
 library_name: transformers
+license: apache-2.0
+base_model: distilgpt2
 tags:
 - generated_from_trainer
 datasets:
 # distilgpt2-finetuned-recipe-nlg-generator
+This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the recipe_nlg dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.0487
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.5203        | 1.0   | 250  | 1.7127          |
+| 0.1633        | 2.0   | 500  | 1.8375          |
+| 0.1527        | 3.0   | 750  | 1.9014          |
+| 0.1471        | 4.0   | 1000 | 1.9561          |
+| 0.1432        | 5.0   | 1250 | 1.9629          |
+| 0.141         | 6.0   | 1500 | 1.9881          |
+| 0.1389        | 7.0   | 1750 | 2.0218          |
+| 0.1377        | 8.0   | 2000 | 2.0492          |
+| 0.1362        | 9.0   | 2250 | 2.0558          |
+| 0.1359        | 10.0  | 2500 | 2.0487          |
 ### Framework versions

runs/Dec26_15-18-47_4f87fa13cf4f/events.out.tfevents.1735226328.4f87fa13cf4f.52529.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:434aea9f19456dfbb07ec296aad138d6c1b324b2c5255fcde726bda729afe38b
-size 9881

 version https://git-lfs.github.com/spec/v1
+oid sha256:4a15be7d907a44894f53e8448acb6b58eab912298b833dbe16655f0d1cf0f77c
+size 10506

runs/Dec26_15-18-47_4f87fa13cf4f/events.out.tfevents.1735228361.4f87fa13cf4f.52529.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ddf825f74d139338fe9d31da53b43bb43a7528cf0f63a9edba6e1843c666bbb4
+size 359