End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -113,7 +113,7 @@ xformers_attention: null
 This model is a fine-tuned version of [unsloth/Qwen2-0.5B](https://huggingface.co/unsloth/Qwen2-0.5B) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.7370
 ## Model description
@@ -152,10 +152,10 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 3.9473        | 0.0007 | 1    | 4.6281          |
-| 3.526         | 0.0036 | 5    | 4.4974          |
-| 3.4724        | 0.0071 | 10   | 3.1816          |
-| 2.9741        | 0.0107 | 15   | 2.8234          |
-| 2.9325        | 0.0142 | 20   | 2.7370          |
 ### Framework versions

 This model is a fine-tuned version of [unsloth/Qwen2-0.5B](https://huggingface.co/unsloth/Qwen2-0.5B) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.7445
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 3.9473        | 0.0007 | 1    | 4.6281          |
+| 3.526         | 0.0036 | 5    | 4.4996          |
+| 3.4748        | 0.0071 | 10   | 3.1894          |
+| 2.9743        | 0.0107 | 15   | 2.8235          |
+| 2.9389        | 0.0142 | 20   | 2.7445          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -21,12 +21,12 @@
   "revision": null,
   "target_modules": [
     "k_proj",
     "v_proj",
-    "gate_proj",
     "down_proj",
-    "o_proj",
-    "up_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "revision": null,
   "target_modules": [
     "k_proj",
+    "o_proj",
     "v_proj",
     "down_proj",
+    "gate_proj",
+    "q_proj",
+    "up_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cda08fc807617498c2977fa39b5bf797d3baf99450cd57204c44fca888071117
 size 70506570

 version https://git-lfs.github.com/spec/v1
+oid sha256:1d44875c7eacfbc04f21df1fc422d81daea4288bcb87199cc85a646d1b15a6eb
 size 70506570

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:300bba3d6dd656ffbc637166ddc9582343511d1ed18f8daea01b247abefa1028
 size 70430032

 version https://git-lfs.github.com/spec/v1
+oid sha256:89feb6d63022ddd3de482dcf3de5c0db6df486708c4d52fff2fc19ee33c59848
 size 70430032

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:86fc0fae7af6c964d28cbdfdf0241b208b3b6f862c8f1df4ccaad6f6a5f33100
 size 6712

 version https://git-lfs.github.com/spec/v1
+oid sha256:8141418bd3841ca1203dbe43091f9f4c2e7bd8a0fcec4b0acaf1e81043f1ff89
 size 6712