savasy commited on
Commit
914cb13
·
1 Parent(s): 144a725

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ This model was trained with MLSUM (Turkish language) summarization datasets where I fine-tuned google/mt5-small using [SimpleT5](https://github.com/Shivanandroy/simpleT5) library.
2
+ The first results are not promising may be due to using small check-points. I will work on it for improvements!
3
+
4
+ The code piece for training
5
+
6
+ ```
7
+ from simplet5 import SimpleT5
8
+ model = SimpleT5()
9
+ model.from_pretrained("mt5","google/mt5-small")
10
+
11
+ # train
12
+ model.train(train_df=train2, # pandas dataframe with 2 columns: source_text & target_text
13
+ eval_df=validation2, # pandas dataframe with 2 columns: source_text & target_text
14
+ source_max_token_len = 512,
15
+ target_max_token_len = 128,
16
+ batch_size = 8,
17
+ max_epochs = 5,
18
+ use_gpu = True,
19
+ outputdir = "mt5_mlsum_turkish",
20
+ early_stopping_patience_epochs = 0,
21
+ precision = 32
22
+ )
23
+ ```
24
+