speech31 commited on
Commit
d961bd0
·
verified ·
1 Parent(s): c2af6a8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -4
README.md CHANGED
@@ -1,12 +1,23 @@
 
 
 
 
 
 
 
 
 
1
  This model is fine-tuned on the Tamil dataset from Common Voice 16.1, preprocessed using Epitran for transliterating text into IPA. The 'tam-Taml' code was employed to generate a precise phoneme list, crucial for capturing the nuances of Tamil phonetics:
2
 
3
- * Vowels: 'a', 'aj', 'aʋ', 'aː', 'e', 'eː', 'i', 'iː', 'o', 'oː', 'u', 'uː'
 
 
 
4
  * Consonants:
5
  * Nasals: 'm', 'n', 'n̪', 'ŋ', 'ɲ', 'ɳ'
6
  * Stops: 'p', 't̪', 'ʈ', 'k',
7
- * Affricates: 'd͡ʒ', 't͡ʃ'
8
  * Fricatives: 'ʋ', 's', 'ʂ', 'ʃ', 'h'
9
  * Approximants: 'j', 'ɻ', 'ɾ', 'l', 'ɭ'
10
  * Consonant cluster: 'kʂ'
11
- * Special Symbols: '்' (denotes absence of inherent vowel)
12
-
 
1
+ ---
2
+ datasets:
3
+ - mozilla-foundation/common_voice_16_1
4
+ language:
5
+ - ta
6
+ metrics:
7
+ - wer
8
+ pipeline_tag: automatic-speech-recognition
9
+ ---
10
  This model is fine-tuned on the Tamil dataset from Common Voice 16.1, preprocessed using Epitran for transliterating text into IPA. The 'tam-Taml' code was employed to generate a precise phoneme list, crucial for capturing the nuances of Tamil phonetics:
11
 
12
+ * Vowels:
13
+ * Monophthongs:'a', 'aː', 'e', 'eː', 'i', 'iː', 'o', 'oː', 'u', 'uː'
14
+ * Diphthongs: 'aj', 'aʋ'
15
+
16
  * Consonants:
17
  * Nasals: 'm', 'n', 'n̪', 'ŋ', 'ɲ', 'ɳ'
18
  * Stops: 'p', 't̪', 'ʈ', 'k',
19
+ * Affricates: 't͡ʃ', 'd͡ʒ'
20
  * Fricatives: 'ʋ', 's', 'ʂ', 'ʃ', 'h'
21
  * Approximants: 'j', 'ɻ', 'ɾ', 'l', 'ɭ'
22
  * Consonant cluster: 'kʂ'
23
+ * Special Symbols: '்' (denotes absence of inherent vowel)