datnth1709 commited on
Commit
e6190ef
·
1 Parent(s): 0c306c1

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -8
README.md CHANGED
@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 5.1754
20
- - Bleu: 0.6140
21
 
22
  ## Model description
23
 
@@ -42,15 +42,18 @@ The following hyperparameters were used during training:
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
- - lr_scheduler_warmup_ratio: 0.5
46
- - num_epochs: 2
47
 
48
  ### Training results
49
 
50
- | Training Loss | Epoch | Step | Validation Loss | Bleu |
51
- |:-------------:|:-----:|:----:|:---------------:|:------:|
52
- | No log | 1.0 | 157 | 6.2996 | 0.1306 |
53
- | No log | 2.0 | 314 | 5.1754 | 0.6140 |
 
 
 
54
 
55
 
56
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.4378
20
+ - Bleu: 81.8571
21
 
22
  ## Model description
23
 
 
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
+ - lr_scheduler_warmup_ratio: 0.2
46
+ - num_epochs: 5
47
 
48
  ### Training results
49
 
50
+ | Training Loss | Epoch | Step | Validation Loss | Bleu |
51
+ |:-------------:|:-----:|:------:|:---------------:|:-------:|
52
+ | 0.6529 | 1.0 | 31250 | 0.5660 | 78.7315 |
53
+ | 0.5125 | 2.0 | 62500 | 0.4770 | 81.3979 |
54
+ | 0.4798 | 3.0 | 93750 | 0.4554 | 81.6720 |
55
+ | 0.4568 | 4.0 | 125000 | 0.4435 | 81.7753 |
56
+ | 0.4387 | 5.0 | 156250 | 0.4378 | 81.8571 |
57
 
58
 
59
  ### Framework versions