Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,24 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
This model was trained with MLSUM (Turkish language) summarization datasets where I fine-tuned google/mt5-small using [SimpleT5](https://github.com/Shivanandroy/simpleT5) library.
|
2 |
+
The first results are not promising may be due to using small check-points. I will work on it for improvements!
|
3 |
+
|
4 |
+
The code piece for training
|
5 |
+
|
6 |
+
```
|
7 |
+
from simplet5 import SimpleT5
|
8 |
+
model = SimpleT5()
|
9 |
+
model.from_pretrained("mt5","google/mt5-small")
|
10 |
+
|
11 |
+
# train
|
12 |
+
model.train(train_df=train2, # pandas dataframe with 2 columns: source_text & target_text
|
13 |
+
eval_df=validation2, # pandas dataframe with 2 columns: source_text & target_text
|
14 |
+
source_max_token_len = 512,
|
15 |
+
target_max_token_len = 128,
|
16 |
+
batch_size = 8,
|
17 |
+
max_epochs = 5,
|
18 |
+
use_gpu = True,
|
19 |
+
outputdir = "mt5_mlsum_turkish",
|
20 |
+
early_stopping_patience_epochs = 0,
|
21 |
+
precision = 32
|
22 |
+
)
|
23 |
+
```
|
24 |
+
|