snu-nia-12
/

wav2vec2-xls-r-300m_nia12_phone-hiragana_japanese

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

snu-nia-12 commited on Jan 11, 2023

Commit

dd84e33

·

1 Parent(s): d335a69

Create README.md

Files changed (1) hide show

README.md +29 -0

README.md ADDED Viewed

	@@ -0,0 +1,29 @@

+---
+language: ja
+datasets:
+- common_voice
+metrics:
+- cer
+model-index:
+- name: wav2vec2-xls-r-300m finetuned on Japanese Hiragana with no word boundaries
+  results:
+  - task:
+      name: Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: Common Voice Japanese
+      type: common_voice
+      args: ja
+    metrics:
+       - name: Test CER
+         type: cer
+         value: 9.34
+---
+# Wav2Vec2-XLS-R-300M-Japanese-Hiragana
+Fine-tuned [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on Japanese Hiragana characters using JSUT, JVS, Common Voice, and in-house dataset.
+The sentence outputs do not contain word boundaries. Audio inputs should be sampled at 16kHz.
+## Test Results
+**CER:** 9.34%
+## Training
+Trained on JSUT, a subset of JVS, train+valid set of Common Voice Japanese, and in-house Japanese dataset. Tested on test set of Common Voice Japanese.