Megareyka commited on
Commit
b1a7c74
·
verified ·
1 Parent(s): 44e8d03

End of training

Browse files
README.md ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: t5-small
4
+ tags:
5
+ - generated_from_trainer
6
+ datasets:
7
+ - xsum
8
+ model-index:
9
+ - name: t5-small-finetuned-xsum
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # t5-small-finetuned-xsum
17
+
18
+ This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the xsum dataset.
19
+
20
+ ## Model description
21
+
22
+ More information needed
23
+
24
+ ## Intended uses & limitations
25
+
26
+ More information needed
27
+
28
+ ## Training and evaluation data
29
+
30
+ More information needed
31
+
32
+ ## Training procedure
33
+
34
+ ### Training hyperparameters
35
+
36
+ The following hyperparameters were used during training:
37
+ - learning_rate: 2e-05
38
+ - train_batch_size: 16
39
+ - eval_batch_size: 16
40
+ - seed: 42
41
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
+ - lr_scheduler_type: linear
43
+ - num_epochs: 1
44
+ - mixed_precision_training: Native AMP
45
+
46
+ ### Training results
47
+
48
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
49
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:-------:|:---------:|:-------:|
50
+ | No log | 1.0 | 128 | 3.0142 | 18.033 | 2.7096 | 14.1251 | 14.3191 | 18.9018 |
51
+
52
+
53
+ ### Framework versions
54
+
55
+ - Transformers 4.38.2
56
+ - Pytorch 2.2.1+cu121
57
+ - Datasets 2.18.0
58
+ - Tokenizers 0.15.2
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "decoder_start_token_id": 0,
3
+ "eos_token_id": 1,
4
+ "pad_token_id": 0,
5
+ "transformers_version": "4.38.2"
6
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:098ec4c867944c16cabce7dd2cafecdc14af7cb402eda45285a75eb3dc19e71f
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d62bae75ceab7e144cd37402e49c6bafacc9405cee08c348d04cb1aac90aaec
3
  size 242041896
runs/Mar20_18-16-55_41439ec362f9/events.out.tfevents.1710958615.41439ec362f9.8264.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:99a36f8fb486e74a4875e96f4be1e5e98eb7829aca4118143da7f71564e16eb8
3
- size 5776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ecd1f7c30f2231661e19268e1b82d9fcb83bf62c408f4ba6a21197d866b08287
3
+ size 5987
runs/Mar20_18-16-55_41439ec362f9/events.out.tfevents.1710960343.41439ec362f9.8264.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aae108dbd37a4a5fcf84f8e3f443c1fe3e904bdfec8cce9380fc48271abd8557
3
+ size 4184
runs/Mar20_18-48-49_41439ec362f9/events.out.tfevents.1710960544.41439ec362f9.16292.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9effa25d5df88b61e8466a4b6f4cd542b19972e47e5dd773478b2372f2396eed
3
+ size 6444
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:34fbba6c2c29b5916e5d6be3ff26fbc64a129e42a91123f91e43344e1eb62c00
3
  size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:504b212605108b9ea18d72e74c1a6fdc4dd018695c0a1c3ccd7f88fa2c96ac91
3
  size 5048