vumichien commited on
Commit
5a6e371
·
1 Parent(s): e334640

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -26
README.md CHANGED
@@ -1,38 +1,76 @@
1
  ---
 
2
  language:
3
  - ja
4
- license: apache-2.0
5
  tags:
6
  - automatic-speech-recognition
7
- - vumichien/common_voice_large_jsut_jsss_css10
8
- - generated_from_trainer
 
9
  model-index:
10
- - name: wav2vec2-xls-r-1b-ja-dumy8
11
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  ---
13
-
14
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
- should probably proofread and complete it, then remove this comment. -->
16
-
17
- # wav2vec2-xls-r-1b-ja-dumy8
18
-
19
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the VUMICHIEN/COMMON_VOICE_LARGE_JSUT_JSSS_CSS10 - JA dataset.
20
- It achieves the following results on the evaluation set:
21
- - Loss: 0.2104
22
- - Wer: 0.1941
23
- - Cer: 0.0991
24
-
25
  ## Model description
26
 
27
- More information needed
28
-
29
- ## Intended uses & limitations
30
-
31
- More information needed
32
-
33
- ## Training and evaluation data
34
-
35
- More information needed
 
 
 
 
 
 
 
36
 
37
  ## Training procedure
38
 
 
1
  ---
2
+ license: apache-2.0
3
  language:
4
  - ja
 
5
  tags:
6
  - automatic-speech-recognition
7
+ - robust-speech-event
8
+ - common-voice
9
+ - ja
10
  model-index:
11
+ - name: wav2vec2-xls-r-1b
12
+ results:
13
+ - task:
14
+ name: Speech Recognition
15
+ type: automatic-speech-recognition
16
+ dataset:
17
+ name: Common Voice 7.0
18
+ type: mozilla-foundation/common_voice_8_0
19
+ args: ja
20
+ metrics:
21
+ - name: Test WER (with LM)
22
+ type: wer
23
+ value: 7.98
24
+ - name: Test CER (with LM)
25
+ type: cer
26
+ value: 3.42
27
+ - task:
28
+ name: Speech Recognition
29
+ type: automatic-speech-recognition
30
+ dataset:
31
+ name: Common Voice 8.0
32
+ type: mozilla-foundation/common_voice_8_0
33
+ args: ja
34
+ metrics:
35
+ - name: Test WER (with LM)
36
+ type: wer
37
+ value: 7.88
38
+ - name: Test CER (with LM)
39
+ type: cer
40
+ value: 3.35
41
+ - task:
42
+ name: Speech Recognition
43
+ type: automatic-speech-recognition
44
+ dataset:
45
+ name: Robust Speech Event - Dev Data
46
+ type: speech-recognition-community-v2/dev_data
47
+ args: ja
48
+ metrics:
49
+ - name: Test WER (with LM)
50
+ type: wer
51
+ value: 28.07
52
+ - name: Test CER (with LM)
53
+ type: cer
54
+ value: 16.27
55
  ---
 
 
 
 
 
 
 
 
 
 
 
 
56
  ## Model description
57
 
58
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on my collection of Public Japanese Voice dataset for research VUMICHIEN/COMMON_VOICE_LARGE_JSUT_JSSS_CSS10.
59
+ ### Benchmark WER result:
60
+ | | [COMMON VOICE 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) | [COMMON VOICE 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0)
61
+ |---|---|---|
62
+ |without LM| 10.96 | 10.91 |
63
+ |with 4-grams LM| 7.98 | 7.88 |
64
+ ### Benchmark CER result:
65
+ | | [COMMON VOICE 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) | [COMMON VOICE 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0)
66
+ |---|---|---|
67
+ |without LM| 4.28 | 4.22 |
68
+ |with 4-grams LM| 3.42 | 3.35 |
69
+ ## Evaluation
70
+ Please use the eval.py file to run the evaluation:
71
+ ```python
72
+ python eval.py --model_id vutankiet2901/wav2vec2-large-xlsr-53-ja --dataset mozilla-foundation/common_voice_7_0 --config ja --split test --log_outputs
73
+ ```
74
 
75
  ## Training procedure
76