thanhkt
/

fine-tuned-16384-pubmed

Text2Text Generation

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

thanhkt commited on 5 days ago

Commit

1676f33

·

verified ·

1 Parent(s): 5f747ad

Update README.md

update readme, tag

Files changed (1) hide show

README.md +9 -4

README.md CHANGED Viewed

@@ -1,7 +1,9 @@
 ---
-base_model: ccdv/lsg-bart-base-16384-pubmed
 tags:
 - generated_from_trainer
 datasets:
 - pubmed-summarization
 metrics:
@@ -9,6 +11,8 @@ metrics:
 model-index:
 - name: fine-tuned-16384-pubmed
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 # fine-tuned-16384-pubmed
-This model is a fine-tuned version of [ccdv/lsg-bart-base-16384-pubmed](https://huggingface.co/ccdv/lsg-bart-base-16384-pubmed) on the pubmed-summarization dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.3719
 - Rouge1: 0.4602
@@ -49,7 +53,7 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 500
 - num_epochs: 2
 ### Training results
@@ -63,6 +67,7 @@ The following hyperparameters were used during training:
 | 1.7104        | 1.3333 | 250  | 1.0021          | 0.4583 | 0.2231 | 0.2918 | 0.4261    |
 | 0.9336        | 1.6    | 300  | 0.5423          | 0.4586 | 0.2228 | 0.2905 | 0.4259    |
 | 0.4902        | 1.8667 | 350  | 0.3719          | 0.4602 | 0.2253 | 0.2911 | 0.4283    |
 ### Framework versions
@@ -70,4 +75,4 @@ The following hyperparameters were used during training:
 - Transformers 4.43.3
 - Pytorch 2.0.0
 - Datasets 2.15.0
-- Tokenizers 0.19.1

 ---
 tags:
 - generated_from_trainer
+- summarize
+- pubmed
+- med
 datasets:
 - pubmed-summarization
 metrics:
 model-index:
 - name: fine-tuned-16384-pubmed
   results: []
+language:
+- en
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # fine-tuned-16384-pubmed
+This model is fine-tuned  on the pubmed-summarization dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.3719
 - Rouge1: 0.4602
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 50
 - num_epochs: 2
 ### Training results
 | 1.7104        | 1.3333 | 250  | 1.0021          | 0.4583 | 0.2231 | 0.2918 | 0.4261    |
 | 0.9336        | 1.6    | 300  | 0.5423          | 0.4586 | 0.2228 | 0.2905 | 0.4259    |
 | 0.4902        | 1.8667 | 350  | 0.3719          | 0.4602 | 0.2253 | 0.2911 | 0.4283    |
+| 0.4032        | 2      | 400  | 0.2967          | 0.4718 | 0.2203 | 0.2871 | 0.4243    |
 ### Framework versions
 - Transformers 4.43.3
 - Pytorch 2.0.0
 - Datasets 2.15.0
+- Tokenizers 0.19.1