Update README.md
Browse files
README.md
CHANGED
@@ -1,12 +1,23 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
This model is fine-tuned on the Tamil dataset from Common Voice 16.1, preprocessed using Epitran for transliterating text into IPA. The 'tam-Taml' code was employed to generate a precise phoneme list, crucial for capturing the nuances of Tamil phonetics:
|
2 |
|
3 |
-
* Vowels:
|
|
|
|
|
|
|
4 |
* Consonants:
|
5 |
* Nasals: 'm', 'n', 'n̪', 'ŋ', 'ɲ', 'ɳ'
|
6 |
* Stops: 'p', 't̪', 'ʈ', 'k',
|
7 |
-
* Affricates:
|
8 |
* Fricatives: 'ʋ', 's', 'ʂ', 'ʃ', 'h'
|
9 |
* Approximants: 'j', 'ɻ', 'ɾ', 'l', 'ɭ'
|
10 |
* Consonant cluster: 'kʂ'
|
11 |
-
* Special Symbols: '்' (denotes absence of inherent vowel)
|
12 |
-
|
|
|
1 |
+
---
|
2 |
+
datasets:
|
3 |
+
- mozilla-foundation/common_voice_16_1
|
4 |
+
language:
|
5 |
+
- ta
|
6 |
+
metrics:
|
7 |
+
- wer
|
8 |
+
pipeline_tag: automatic-speech-recognition
|
9 |
+
---
|
10 |
This model is fine-tuned on the Tamil dataset from Common Voice 16.1, preprocessed using Epitran for transliterating text into IPA. The 'tam-Taml' code was employed to generate a precise phoneme list, crucial for capturing the nuances of Tamil phonetics:
|
11 |
|
12 |
+
* Vowels:
|
13 |
+
* Monophthongs:'a', 'aː', 'e', 'eː', 'i', 'iː', 'o', 'oː', 'u', 'uː'
|
14 |
+
* Diphthongs: 'aj', 'aʋ'
|
15 |
+
|
16 |
* Consonants:
|
17 |
* Nasals: 'm', 'n', 'n̪', 'ŋ', 'ɲ', 'ɳ'
|
18 |
* Stops: 'p', 't̪', 'ʈ', 'k',
|
19 |
+
* Affricates: 't͡ʃ', 'd͡ʒ'
|
20 |
* Fricatives: 'ʋ', 's', 'ʂ', 'ʃ', 'h'
|
21 |
* Approximants: 'j', 'ɻ', 'ɾ', 'l', 'ɭ'
|
22 |
* Consonant cluster: 'kʂ'
|
23 |
+
* Special Symbols: '்' (denotes absence of inherent vowel)
|
|