End of training

Browse files

Files changed (3) hide show

README.md +82 -0
adapter_model.safetensors +1 -1
runs/Jul30_21-18-14_oem-WS-C621E-SAGE-Series/events.out.tfevents.1722341897.oem-WS-C621E-SAGE-Series.2700248.0 +2 -2

README.md ADDED Viewed

	@@ -0,0 +1,82 @@

+---
+license: apache-2.0
+library_name: peft
+tags:
+- generated_from_trainer
+base_model: slplab/polyglot-ko-1.3b_pretrained-asd
+model-index:
+- name: pretrained-asd_wo-cot_w-asd
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# pretrained-asd_wo-cot_w-asd
+This model is a fine-tuned version of [slplab/polyglot-ko-1.3b_pretrained-asd](https://huggingface.co/slplab/polyglot-ko-1.3b_pretrained-asd) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 10.9440
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 3e-05
+- train_batch_size: 2
+- eval_batch_size: 2
+- seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 8
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 3
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 13.2673       | 0.1290 | 1    | 13.2282         |
+| 13.2184       | 0.2581 | 2    | 13.1124         |
+| 13.0976       | 0.3871 | 3    | 12.9704         |
+| 12.9545       | 0.5161 | 4    | 12.8240         |
+| 12.8232       | 0.6452 | 5    | 12.6843         |
+| 12.6522       | 0.7742 | 6    | 12.5389         |
+| 12.5444       | 0.9032 | 7    | 12.3854         |
+| 12.3496       | 1.0323 | 8    | 12.2316         |
+| 12.2126       | 1.1613 | 9    | 12.0854         |
+| 12.0724       | 1.2903 | 10   | 11.9376         |
+| 11.9188       | 1.4194 | 11   | 11.7927         |
+| 11.7615       | 1.5484 | 12   | 11.6542         |
+| 11.6456       | 1.6774 | 13   | 11.5224         |
+| 11.5276       | 1.8065 | 14   | 11.4017         |
+| 11.3736       | 1.9355 | 15   | 11.2929         |
+| 11.281        | 2.0645 | 16   | 11.1996         |
+| 11.15         | 2.1935 | 17   | 11.1171         |
+| 11.0963       | 2.3226 | 18   | 11.0492         |
+| 11.0646       | 2.4516 | 19   | 10.9964         |
+| 10.9801       | 2.5806 | 20   | 10.9614         |
+| 10.9325       | 2.7097 | 21   | 10.9440         |
+### Framework versions
+- PEFT 0.11.1
+- Transformers 4.41.1
+- Pytorch 2.3.0+cu121
+- Datasets 2.19.1
+- Tokenizers 0.19.1

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:09f2c753414b699f0d5879c27fdfbd62499e956ab4e7c9f2d4e2b15840efebb6
 size 25192400

 version https://git-lfs.github.com/spec/v1
+oid sha256:13db6779b8f43fcb9921a3af9c33d7827dca685aa64f1fc84fd45d61756d7cab
 size 25192400

runs/Jul30_21-18-14_oem-WS-C621E-SAGE-Series/events.out.tfevents.1722341897.oem-WS-C621E-SAGE-Series.2700248.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8c5c304142d2b6c3c540e0d53aeb7d60540a6be91a08883f766f9908beb6f01b
-size 14583

 version https://git-lfs.github.com/spec/v1
+oid sha256:cb37dace8be5623105fb3f65b7cdf24b4a40843a3e4e651d085bf2f14da6e8fe
+size 15404