thanhkt commited on
Commit
1676f33
·
verified ·
1 Parent(s): 5f747ad

Update README.md

Browse files

update readme, tag

Files changed (1) hide show
  1. README.md +9 -4
README.md CHANGED
@@ -1,7 +1,9 @@
1
  ---
2
- base_model: ccdv/lsg-bart-base-16384-pubmed
3
  tags:
4
  - generated_from_trainer
 
 
 
5
  datasets:
6
  - pubmed-summarization
7
  metrics:
@@ -9,6 +11,8 @@ metrics:
9
  model-index:
10
  - name: fine-tuned-16384-pubmed
11
  results: []
 
 
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # fine-tuned-16384-pubmed
18
 
19
- This model is a fine-tuned version of [ccdv/lsg-bart-base-16384-pubmed](https://huggingface.co/ccdv/lsg-bart-base-16384-pubmed) on the pubmed-summarization dataset.
20
  It achieves the following results on the evaluation set:
21
  - Loss: 0.3719
22
  - Rouge1: 0.4602
@@ -49,7 +53,7 @@ The following hyperparameters were used during training:
49
  - total_train_batch_size: 16
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - lr_scheduler_warmup_steps: 500
53
  - num_epochs: 2
54
 
55
  ### Training results
@@ -63,6 +67,7 @@ The following hyperparameters were used during training:
63
  | 1.7104 | 1.3333 | 250 | 1.0021 | 0.4583 | 0.2231 | 0.2918 | 0.4261 |
64
  | 0.9336 | 1.6 | 300 | 0.5423 | 0.4586 | 0.2228 | 0.2905 | 0.4259 |
65
  | 0.4902 | 1.8667 | 350 | 0.3719 | 0.4602 | 0.2253 | 0.2911 | 0.4283 |
 
66
 
67
 
68
  ### Framework versions
@@ -70,4 +75,4 @@ The following hyperparameters were used during training:
70
  - Transformers 4.43.3
71
  - Pytorch 2.0.0
72
  - Datasets 2.15.0
73
- - Tokenizers 0.19.1
 
1
  ---
 
2
  tags:
3
  - generated_from_trainer
4
+ - summarize
5
+ - pubmed
6
+ - med
7
  datasets:
8
  - pubmed-summarization
9
  metrics:
 
11
  model-index:
12
  - name: fine-tuned-16384-pubmed
13
  results: []
14
+ language:
15
+ - en
16
  ---
17
 
18
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
20
 
21
  # fine-tuned-16384-pubmed
22
 
23
+ This model is fine-tuned on the pubmed-summarization dataset.
24
  It achieves the following results on the evaluation set:
25
  - Loss: 0.3719
26
  - Rouge1: 0.4602
 
53
  - total_train_batch_size: 16
54
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
55
  - lr_scheduler_type: linear
56
+ - lr_scheduler_warmup_steps: 50
57
  - num_epochs: 2
58
 
59
  ### Training results
 
67
  | 1.7104 | 1.3333 | 250 | 1.0021 | 0.4583 | 0.2231 | 0.2918 | 0.4261 |
68
  | 0.9336 | 1.6 | 300 | 0.5423 | 0.4586 | 0.2228 | 0.2905 | 0.4259 |
69
  | 0.4902 | 1.8667 | 350 | 0.3719 | 0.4602 | 0.2253 | 0.2911 | 0.4283 |
70
+ | 0.4032 | 2 | 400 | 0.2967 | 0.4718 | 0.2203 | 0.2871 | 0.4243 |
71
 
72
 
73
  ### Framework versions
 
75
  - Transformers 4.43.3
76
  - Pytorch 2.0.0
77
  - Datasets 2.15.0
78
+ - Tokenizers 0.19.1