jefson08 commited on
Commit
1936255
·
verified ·
1 Parent(s): 04eb0cb

Model save

Browse files
README.md CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model was trained from scratch on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 8.2659
18
 
19
  ## Model description
20
 
@@ -37,8 +37,8 @@ The following hyperparameters were used during training:
37
  - train_batch_size: 16
38
  - eval_batch_size: 2
39
  - seed: 42
40
- - gradient_accumulation_steps: 32
41
- - total_train_batch_size: 512
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
  - num_epochs: 1
@@ -47,7 +47,7 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:------:|:----:|:---------------:|
50
- | 5.2959 | 0.6824 | 1000 | 8.2659 |
51
 
52
 
53
  ### Framework versions
 
14
 
15
  This model was trained from scratch on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 11.3743
18
 
19
  ## Model description
20
 
 
37
  - train_batch_size: 16
38
  - eval_batch_size: 2
39
  - seed: 42
40
+ - gradient_accumulation_steps: 64
41
+ - total_train_batch_size: 1024
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
  - num_epochs: 1
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:------:|:----:|:---------------:|
50
+ | 9.9277 | 0.7051 | 600 | 11.3743 |
51
 
52
 
53
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1fea087410d40ce1ad75918de426b3db5d4f6ec0b717a75ea386b6a425a345df
3
  size 2247492800
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4ae69b7b117873a4a85c2d23b01e8b58573d9a1c1367bd771f44d2b175399ca6
3
  size 2247492800
runs/Sep11_21-40-37_ip-10-192-10-80/events.out.tfevents.1726090837.ip-10-192-10-80.3114.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5827bffeb30afa90f859db76a47859e5ea0c46b8e914dcbf90e17dafc20a72ae
3
- size 41921
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb4d35cc73ccc0d84fc5d8aae1ee06700e09eda47c53c25bbb1701d07d63b1c2
3
+ size 42275