itzune
/

argostranslate_en_eu_1_0

Text2Text Generation

Model card Files Files and versions Community

urtzai commited on Nov 8, 2024

Commit

66df461

·

verified ·

1 Parent(s): 7e0b91c

Update README.md

Files changed (1) hide show

README.md +33 -1

README.md CHANGED Viewed

@@ -6,6 +6,38 @@ language:
 tags:
 - text2text-generation
 - open-nmt
 ---
-# EN -> EU argos model

 tags:
 - text2text-generation
 - open-nmt
+- pytorch
 ---
+# Itzune EN -> EU machine translation argos model
+This model was trained using [argostrain](https://github.com/argosopentech/argos-train) training scripts with ~12M English to Basque parallel string datasets obtained directly from the [Opus project](https://opus.nlpl.eu/).
+## Model description
+- **Developed by:** Basque community
+- **Model type:** traslation
+- **Model version:** v0.1
+- **Source Language:** English
+- **Target Language:** Basque
+- **License:** MIT
+<!-- ## Training Data
+The English-Basque parallel sentences were collected from the following datasets:
+| Dataset       	   | Sentences before cleaning |
+|----------------------|--------------------------:|
+| CCMatrix v1          | 7,788,871  	           |
+| EhuHac v1            | 585,210	               |
+| bible-uedin v1       | 15,893                    |
+| GNOME v1             | 652,298                   |
+| HPLT v1.1            | 610,694                   |
+| KDE4 v2              | 100,160                   |
+| OpenSubtitles	v2018  | 805,780                   |
+| Tatoeba v2023-04-12  | 2,070                     |
+| WikiMatrix v1	       | 119,480                   |
+| wikimedia v20230407  | 60,990                    |
+| XLEnt v1.2           | 800,631                   |
+| **Total**     	   | **15,653,108**            | -->