adrianSauer
/

wav2vec2-cer-extension

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

adrianSauer commited on Aug 20, 2024

Commit

630a7b7

·

verified ·

1 Parent(s): 6c3973c

End of training

Files changed (1) hide show

README.md +13 -11

README.md CHANGED Viewed

@@ -1,4 +1,5 @@
 ---
 language:
 - gn
 license: apache-2.0
@@ -19,8 +20,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3309
-- Cer: 7.5608
 ## Model description
@@ -39,7 +40,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
 - train_batch_size: 8
 - eval_batch_size: 16
 - seed: 42
@@ -47,24 +48,25 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
-- lr_scheduler_warmup_steps: 50
-- training_steps: 500
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Cer    |
 |:-------------:|:------:|:----:|:---------------:|:------:|
-| 1.3968        | 0.0991 | 100  | 0.3683          | 8.4273 |
-| 1.061         | 0.1982 | 200  | 0.3611          | 8.5093 |
-| 1.0374        | 0.2973 | 300  | 0.3500          | 8.1463 |
-| 0.9825        | 0.3964 | 400  | 0.3458          | 7.9394 |
-| 0.9185        | 0.4955 | 500  | 0.3309          | 7.5608 |
 ### Framework versions
-- Transformers 4.44.0
 - Pytorch 2.3.1+cu121
 - Datasets 2.21.0
 - Tokenizers 0.19.1

 ---
+library_name: transformers
 language:
 - gn
 license: apache-2.0
 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2425
+- Cer: 5.9170
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 8
 - eval_batch_size: 16
 - seed: 42
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
+- lr_scheduler_warmup_steps: 3000
+- training_steps: 3000
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Cer    |
 |:-------------:|:------:|:----:|:---------------:|:------:|
+| 1.2573        | 0.4955 | 500  | 0.3703          | 8.4904 |
+| 0.9205        | 0.9911 | 1000 | 0.3224          | 7.6296 |
+| 0.7466        | 1.4866 | 1500 | 0.2938          | 7.1221 |
+| 0.6766        | 1.9822 | 2000 | 0.2715          | 6.6510 |
+| 0.5782        | 2.4777 | 2500 | 0.2831          | 7.0497 |
+| 0.5495        | 2.9732 | 3000 | 0.2425          | 5.9170 |
 ### Framework versions
+- Transformers 4.44.1
 - Pytorch 2.3.1+cu121
 - Datasets 2.21.0
 - Tokenizers 0.19.1