speech31
/

XLS-R-tamil-phoneme

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

speech31 commited on Feb 21, 2024

Commit

d961bd0

·

verified ·

1 Parent(s): c2af6a8

Update README.md

Files changed (1) hide show

README.md +15 -4

README.md CHANGED Viewed

@@ -1,12 +1,23 @@
 This model is fine-tuned on the Tamil dataset from Common Voice 16.1, preprocessed using Epitran for transliterating text into IPA. The 'tam-Taml' code was employed to generate a precise phoneme list, crucial for capturing the nuances of Tamil phonetics:
-* Vowels: 'a', 'aj', 'aʋ', 'aː', 'e', 'eː', 'i', 'iː', 'o', 'oː', 'u', 'uː'
 * Consonants:
   * Nasals: 'm', 'n', 'n̪', 'ŋ', 'ɲ', 'ɳ'
   * Stops: 'p', 't̪', 'ʈ', 'k',
-  * Affricates: 'd͡ʒ', 't͡ʃ'
   * Fricatives: 'ʋ', 's', 'ʂ', 'ʃ', 'h'
   * Approximants: 'j', 'ɻ', 'ɾ', 'l', 'ɭ'
   * Consonant cluster: 'kʂ'
-* Special Symbols: '்' (denotes absence of inherent vowel)

+---
+datasets:
+- mozilla-foundation/common_voice_16_1
+language:
+- ta
+metrics:
+- wer
+pipeline_tag: automatic-speech-recognition
+---
 This model is fine-tuned on the Tamil dataset from Common Voice 16.1, preprocessed using Epitran for transliterating text into IPA. The 'tam-Taml' code was employed to generate a precise phoneme list, crucial for capturing the nuances of Tamil phonetics:
+* Vowels:
+  * Monophthongs:'a', 'aː', 'e', 'eː', 'i', 'iː', 'o', 'oː', 'u', 'uː'
+  * Diphthongs: 'aj', 'aʋ'
 * Consonants:
   * Nasals: 'm', 'n', 'n̪', 'ŋ', 'ɲ', 'ɳ'
   * Stops: 'p', 't̪', 'ʈ', 'k',
+  * Affricates:  't͡ʃ', 'd͡ʒ'
   * Fricatives: 'ʋ', 's', 'ʂ', 'ʃ', 'h'
   * Approximants: 'j', 'ɻ', 'ɾ', 'l', 'ɭ'
   * Consonant cluster: 'kʂ'
+* Special Symbols: '்' (denotes absence of inherent vowel)