ParitKansal commited on
Commit
3b74bd3
·
verified ·
1 Parent(s): 94f131c

Training complete

Browse files
README.md CHANGED
@@ -19,11 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 1.9696
23
- - Rouge1: 49.0677
24
- - Rouge2: 26.1575
25
- - Rougel: 45.3308
26
- - Rougelsum: 45.3274
27
 
28
  ## Model description
29
 
@@ -48,13 +48,17 @@ The following hyperparameters were used during training:
48
  - seed: 42
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
- - num_epochs: 1
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
56
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
57
- | 2.6768 | 1.0 | 1384 | 1.9696 | 49.0677 | 26.1575 | 45.3308 | 45.3274 |
 
 
 
 
58
 
59
 
60
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 1.7287
23
+ - Rouge1: 52.033
24
+ - Rouge2: 28.5069
25
+ - Rougel: 47.9951
26
+ - Rougelsum: 47.994
27
 
28
  ## Model description
29
 
 
48
  - seed: 42
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
+ - num_epochs: 6
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
56
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
57
+ | 3.9223 | 1.0 | 1384 | 2.0230 | 48.3053 | 25.5 | 44.5689 | 44.5717 |
58
+ | 2.4615 | 2.0 | 2768 | 1.8415 | 50.6518 | 27.4135 | 46.7611 | 46.7466 |
59
+ | 2.2896 | 3.0 | 4152 | 1.7868 | 51.4143 | 27.9301 | 47.4151 | 47.4095 |
60
+ | 2.1912 | 5.0 | 6920 | 1.7372 | 51.912 | 28.3549 | 47.8763 | 47.8849 |
61
+ | 2.1537 | 6.0 | 8304 | 1.7287 | 52.033 | 28.5069 | 47.9951 | 47.994 |
62
 
63
 
64
  ### Framework versions
runs/Jan16_11-48-58_2d47da243ae0/events.out.tfevents.1737028141.2d47da243ae0.3028.6 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d0cc53d8d8baec66fdbc712e78be71647ebef355e3592aa6556e27bbe730cd4d
3
- size 6301
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7035752f0ea5e433147fe27ffbdfa338801e7392f7cc4d2f40e8c1e9387bd38f
3
+ size 7129