PereLluis13
/

wav2vec2-xls-r-300m-ca-lm

@@ -8,9 +8,55 @@ tags:
 - collectivat/tv3_parla
 - projecte-aina/parlament_parla
 - generated_from_trainer
 model-index:
-- name: wav2vec2-xls-r-300m-ca
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -19,7 +65,7 @@ should probably proofread and complete it, then remove this comment. -->
 # wav2vec2-xls-r-300m-ca
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - CA dataset.
-It achieves the following results on the evaluation set:
 - Loss: 0.2758
 - Wer: 0.1792
@@ -52,7 +98,7 @@ The following hyperparameters were used during training:
 - num_epochs: 6.0
 - mixed_precision_training: Native AMP
-### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Wer    |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|

 - collectivat/tv3_parla
 - projecte-aina/parlament_parla
 - generated_from_trainer
+datasets:
+- mozilla-foundation/common_voice_8_0
+- collectivat/tv3_parla
+- projecte-aina/parlament_parla
 model-index:
+- name: wav2vec2-xls-r-300m-ca-lm
+  results:
+  - task:
+      name: Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: mozilla-foundation/common_voice_8_0 ca
+      type: mozilla-foundation/common_voice_8_0
+      args: ca
+    metrics:
+       - name: Test WER
+         type: wer
+         value: 0.08108860330598514
+       - name: Test CER
+         type: cer
+         value: 0.027241712812152218
+  - task:
+      name: Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: projecte-aina/parlament_parla ca
+      type: projecte-aina/parlament_parla
+      args: clean
+    metrics:
+       - name: Test WER
+         type: wer
+         value: 0.06541946111307212
+       - name: Test CER
+         type: cer
+         value: 0.02205785796827398
+  - task:
+      name: Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: collectivat/tv3_parla ca
+      type: collectivat/tv3_parla
+      args: ca
+    metrics:
+       - name: Test WER
+         type: wer
+         value: 0.1506717480848443
+       - name: Test CER
+         type: cer
+         value: 0.09562445266717665
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # wav2vec2-xls-r-300m-ca
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - CA dataset.
+It achieves the following results on the averaged across datasets test set:
 - Loss: 0.2758
 - Wer: 0.1792
 - num_epochs: 6.0
 - mixed_precision_training: Native AMP
+### Training results (without LM)
 | Training Loss | Epoch | Step  | Validation Loss | Wer    |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|