Andrija
/

SRoBERTa-F

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Aleksandar commited on Oct 7, 2021

Commit

be66bc0

•

1 Parent(s): 6607ffa

Added readme file

Files changed (1) hide show

README.md +34 -0

README.md ADDED Viewed

	@@ -0,0 +1,34 @@

+---
+datasets:
+- oscar
+- srwac
+- leipzig
+- cc100
+- hrwac
+language:
+- hr
+- sr
+tags:
+- masked-lm
+widget:
+- text: "Ovo je početak <mask>."
+license: apache-2.0
+---
+# Transformer language model for Croatian and Serbian
+Trained on 43GB datasets that contain Croatian and Serbian language for one epochs (9.6 mil. steps, 3 epochs).
+Leipzig Corpus, OSCAR, srWac, hrWac, cc100-hr and cc100-sr  datasets
+Validation number of exampels run for perplexity:1620487 sentences
+Perplexity:6.02
+Start loss: 8.6
+Final loss: 2.0
+Thoughts: Model could be trained more, the training did not stagnate.
+| Model                          | #params                        | Arch. | Training data                     |
+|--------------------------------|--------------------------------|-------|-----------------------------------|
+| `Andrija/SRoBERTa-X` | 80M   | Fifth | Leipzig Corpus, OSCAR, srWac, hrWac, cc100-hr and cc100-sr  (43 GB of text)             |