Amr-khaled commited on
Commit
cee1440
·
verified ·
1 Parent(s): 1b71fa4

Model save

Browse files
Files changed (2) hide show
  1. README.md +14 -17
  2. model.safetensors +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [riotu-lab/ArabianGPT-03B](https://huggingface.co/riotu-lab/ArabianGPT-03B) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 5.9804
19
 
20
  ## Model description
21
 
@@ -42,27 +42,24 @@ The following hyperparameters were used during training:
42
  - total_train_batch_size: 32
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: cosine
45
- - num_epochs: 4
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 6.1892 | 0.26 | 10 | 4.8197 |
52
- | 4.7341 | 0.51 | 20 | 4.1928 |
53
- | 4.1782 | 0.77 | 30 | 4.0742 |
54
- | 4.1255 | 1.03 | 40 | 4.0078 |
55
- | 3.3704 | 1.28 | 50 | 4.1840 |
56
- | 3.4364 | 1.54 | 60 | 4.4455 |
57
- | 3.3383 | 1.79 | 70 | 4.3615 |
58
- | 3.1346 | 2.05 | 80 | 4.3538 |
59
- | 2.451 | 2.31 | 90 | 5.5225 |
60
- | 2.456 | 2.56 | 100 | 5.2477 |
61
- | 2.3692 | 2.82 | 110 | 5.0649 |
62
- | 2.2492 | 3.08 | 120 | 5.3660 |
63
- | 1.8112 | 3.33 | 130 | 5.8835 |
64
- | 1.8044 | 3.59 | 140 | 5.9958 |
65
- | 1.8347 | 3.85 | 150 | 5.9804 |
66
 
67
 
68
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [riotu-lab/ArabianGPT-03B](https://huggingface.co/riotu-lab/ArabianGPT-03B) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 4.8610
19
 
20
  ## Model description
21
 
 
42
  - total_train_batch_size: 32
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: cosine
45
+ - num_epochs: 3
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 6.4515 | 0.25 | 10 | 4.5855 |
52
+ | 4.8912 | 0.49 | 20 | 4.1608 |
53
+ | 4.3524 | 0.74 | 30 | 4.0509 |
54
+ | 4.1537 | 0.99 | 40 | 4.0484 |
55
+ | 3.6716 | 1.23 | 50 | 4.0211 |
56
+ | 3.4284 | 1.48 | 60 | 4.1357 |
57
+ | 3.5215 | 1.73 | 70 | 4.2520 |
58
+ | 3.4336 | 1.98 | 80 | 4.0270 |
59
+ | 2.8886 | 2.22 | 90 | 4.9232 |
60
+ | 2.6176 | 2.47 | 100 | 5.0723 |
61
+ | 2.5867 | 2.72 | 110 | 4.8623 |
62
+ | 2.6076 | 2.96 | 120 | 4.8610 |
 
 
 
63
 
64
 
65
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2f667fc8b1fbc29695041b63277a9ad563361b60b653406c7a11416e80a3accd
3
  size 1475630744
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d9d336f6156e222a765f60df8fd6a695b82c852f39cdf9705aef90b23dbfd9b
3
  size 1475630744