End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3600
-- Accuracy: 0.925
 ## Model description
@@ -39,8 +39,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -48,13 +48,13 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| No log        | 1.0   | 24   | 1.0517          | 0.4833   |
-| No log        | 2.0   | 48   | 0.8261          | 0.6333   |
-| No log        | 3.0   | 72   | 0.5273          | 0.8167   |
-| No log        | 4.0   | 96   | 0.3771          | 0.8833   |
-| No log        | 5.0   | 120  | 0.3600          | 0.925    |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1445
+- Accuracy: 0.9792
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 1
+- eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Accuracy |
+|:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.2079        | 1.0   | 7004  | 0.0928          | 0.9668   |
+| 0.084         | 2.0   | 14008 | 0.1063          | 0.9628   |
+| 0.0611        | 3.0   | 21012 | 0.1121          | 0.9752   |
+| 0.0717        | 4.0   | 28016 | 0.1514          | 0.9786   |
+| 0.0277        | 5.0   | 35020 | 0.1445          | 0.9792   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ebf4054b11848cabde875138330a4120634b06e8fa215eb9df0a83633576d3f8
 size 4943307184

 version https://git-lfs.github.com/spec/v1
+oid sha256:8edea161ce0ac12a8bc941364b173005cc4da3a38d1e35f9530301c0484f51c3
 size 4943307184

runs/Sep28_02-47-26_5e6f55c5a0b3/events.out.tfevents.1727491647.5e6f55c5a0b3.1309.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:636341c42f782c82ddda3ca1a2560858de9e09b1feb50763cd1b12d09b453cb9
-size 21549

 version https://git-lfs.github.com/spec/v1
+oid sha256:4acea96d5d0aa3cee6cbac8e8093ae56e4058517527ca2c98a58d94483404ff1
+size 22238