ccibeekeoc42
commited on
Initial Readme
Browse files
README.md
ADDED
@@ -0,0 +1,58 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- ccibeekeoc42/TinyStories_yoruba
|
5 |
+
- ccibeekeoc42/TinyStories_igbo
|
6 |
+
- ccibeekeoc42/DollyHHRLHF_igbo
|
7 |
+
- ccibeekeoc42/DollyHHRLHF_yoruba
|
8 |
+
language:
|
9 |
+
- ig
|
10 |
+
- yo
|
11 |
+
- en
|
12 |
+
metrics:
|
13 |
+
- accuracy
|
14 |
+
library_name: adapter-transformers
|
15 |
+
pipeline_tag: text-generation
|
16 |
+
---
|
17 |
+
|
18 |
+
## Model Name: Multilingual LLM for English, Igbo, and Yoruba Translation
|
19 |
+
|
20 |
+
This model specializes in understanding and generating text across English, Igbo, and Yoruba, trained on curated datasets including "tiny_stories" and "dolly_hhrlhf." Utilizing advanced techniques to ensure efficiency and accuracy, the model supports a wide range of applications from translation to content creation.
|
21 |
+
|
22 |
+
## Key Features:
|
23 |
+
- **Trilingual Support:** Seamlessly processes and generates content in English, Igbo, and Yoruba, promoting linguistic diversity and accessibility.
|
24 |
+
- **Custom Training Approach:** Employs a tailored training setup, leveraging specific prompts and responses to enhance model performance on relevant tasks.
|
25 |
+
- **High Sequence Capability:** Handles extensive text inputs (up to 3453 tokens), making it suitable for detailed narratives and complex translation tasks.
|
26 |
+
- **Efficiency Optimizations:** Incorporates strategies such as dataset packing, reducing computational demands while maintaining high-quality output.
|
27 |
+
|
28 |
+
## Applications:
|
29 |
+
- **Translation Services:** Offers precise, context-aware translations, bridging communication gaps between English, Igbo, and Yoruba speakers.
|
30 |
+
- **Content Generation:** Generates culturally and linguistically nuanced content, catering to a diverse audience.
|
31 |
+
- **Educational Tools:** Assists in language learning and preservation, providing resources in underrepresented languages.
|
32 |
+
|
33 |
+
## Future Directions:
|
34 |
+
- Further refinement with diverse text sources to enhance understanding and generation capabilities.
|
35 |
+
- Expansion to additional languages, supporting broader multilingual communication.
|
36 |
+
|
37 |
+
This model represents a step towards more inclusive language technologies, recognizing the importance of language diversity in global communication.
|
38 |
+
|
39 |
+
## Training Loss:
|
40 |
+
- Below is the training loss.
|
41 |
+
|
42 |
+
| Step | Training Loss |
|
43 |
+
|------|---------------|
|
44 |
+
| 200 | 1.490900 |
|
45 |
+
| 400 | 1.375600 |
|
46 |
+
| 600 | 1.304100 |
|
47 |
+
| 800 | 1.198700 |
|
48 |
+
| 1000 | 1.228200 |
|
49 |
+
| 1125 | 1.226600 |
|
50 |
+
|
51 |
+
|
52 |
+
## About the Creators
|
53 |
+
[Christopher Ibe](https://www.linkedin.com/in/christopher-ibe-ekeocha/) and [Okezie Okoye](https://www.linkedin.com/in/okezie-okoye-43432b62/) continue to lead Hypa AI towards new frontiers in AI translation. Their dedication to leveraging advanced AI for genuine understanding and connection across language barriers is what sets Hypa AI apart in the field of artificial intelligence.
|
54 |
+
|
55 |
+
*Hypa AI* remains steadfast in its mission to pioneer intelligent solutions that are not just technologically advanced but are also culturally aware, ensuring that the future of AI is as diverse and inclusive as the world it serves.
|
56 |
+
|
57 |
+
*AfroVoices*, a subsidiary of Hypa AI, is dedicated to amplifying African voices, languages, and cultures in the intelligence age. Focused on bridging the digital representation gap, AfroVoices curates datasets and resources for African languages, promoting inclusivity and cultural appreciation in AI technologies. Their mission goes beyond technological innovation, aiming to celebrate the richness of African linguistic diversity on a global stage.
|
58 |
+
|