Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,8 @@
|
|
1 |
t5-small pretrained on
|
2 |
-
|
3 |
-
|
|
|
|
|
|
|
4 |
|
5 |
tokenizer: sentencepiece 8K shared vocabulary
|
|
|
1 |
t5-small pretrained on
|
2 |
+
|
3 |
+
• kbd (latin script) 835K lines: a pile of scraped text from news sites, books etc.
|
4 |
+
|
5 |
+
• ru 3M lines: wiki corpus from OPUS
|
6 |
+
|
7 |
|
8 |
tokenizer: sentencepiece 8K shared vocabulary
|