rahul-bhoyar-1995 commited on
Commit
0a28827
·
verified ·
1 Parent(s): d6117ed

End of training

Browse files
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: transformers
3
  license: mit
4
- base_model: gpt2
5
  tags:
6
  - generated_from_trainer
7
  model-index:
@@ -14,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # reuters-gpt2-text-gen
16
 
17
- This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 5.4664
20
 
21
  ## Model description
22
 
@@ -43,15 +43,14 @@ The following hyperparameters were used during training:
43
  - total_train_batch_size: 64
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: cosine
46
- - num_epochs: 2
47
  - mixed_precision_training: Native AMP
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:------:|:----:|:---------------:|
53
- | 5.6546 | 0.9940 | 125 | 5.7548 |
54
- | 5.2071 | 1.9881 | 250 | 5.4664 |
55
 
56
 
57
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
  license: mit
4
+ base_model: rahul-bhoyar-1995/reuters-gpt2-text-gen
5
  tags:
6
  - generated_from_trainer
7
  model-index:
 
14
 
15
  # reuters-gpt2-text-gen
16
 
17
+ This model is a fine-tuned version of [rahul-bhoyar-1995/reuters-gpt2-text-gen](https://huggingface.co/rahul-bhoyar-1995/reuters-gpt2-text-gen) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 5.3160
20
 
21
  ## Model description
22
 
 
43
  - total_train_batch_size: 64
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: cosine
46
+ - num_epochs: 1
47
  - mixed_precision_training: Native AMP
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:------:|:----:|:---------------:|
53
+ | 5.0816 | 0.9940 | 125 | 5.3160 |
 
54
 
55
 
56
  ### Framework versions
runs/Oct17_09-35-53_ip-10-192-11-29/events.out.tfevents.1729157767.ip-10-192-11-29.10438.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4d29c9a38048804706603406a7a2b5ff79ff94236c4c9163855f8b616c3ff91d
3
- size 7763
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:446abda5cc6dacf5df93d17abe22bdd51b085f3de9cf47c2c2eb3cb87d94ea79
3
+ size 8377