ales
/

whisper-tiny-be-test

@@ -1,41 +1,38 @@
 ---
-language:
-- be
 license: apache-2.0
 tags:
-- whisper-event
 - generated_from_trainer
 datasets:
-- mozilla-foundation/common_voice_11_0
 metrics:
 - wer
 model-index:
-- name: Whisper Tiny Belarusian
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: mozilla-foundation/common_voice_11_0 be
-      type: mozilla-foundation/common_voice_11_0
       config: be
       split: validation
       args: be
     metrics:
     - name: Wer
       type: wer
-      value: 60.07326007326007
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Tiny Belarusian
-This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the mozilla-foundation/common_voice_11_0 be dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6389
-- Wer: 60.0733
 ## Model description
@@ -61,23 +58,33 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 10
-- training_steps: 100
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
-| 2.5622        | 0.1   | 10   | 1.5402          | 94.5055 |
-| 1.3719        | 0.2   | 20   | 1.0012          | 75.2747 |
-| 0.9898        | 0.3   | 30   | 0.8217          | 72.7106 |
-| 0.9742        | 0.4   | 40   | 0.7924          | 72.5275 |
-| 0.6951        | 0.5   | 50   | 0.7628          | 76.1905 |
-| 0.7824        | 0.6   | 60   | 0.6738          | 65.3846 |
-| 0.6818        | 0.7   | 70   | 0.6389          | 60.0733 |
-| 0.7823        | 0.8   | 80   | 0.6208          | 65.7509 |
-| 0.5994        | 0.9   | 90   | 0.5901          | 61.9048 |
-| 0.6647        | 1.0   | 100  | 0.5790          | 61.7216 |
 ### Framework versions

 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
 datasets:
+- common_voice_11_0
 metrics:
 - wer
 model-index:
+- name: whisper-tiny-be-test
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: common_voice_11_0
+      type: common_voice_11_0
       config: be
       split: validation
       args: be
     metrics:
     - name: Wer
       type: wer
+      value: 51.46520146520146
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# whisper-tiny-be-test
+This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the common_voice_11_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4624
+- Wer: 51.4652
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 10
+- training_steps: 200
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 2.5366        | 0.05  | 10   | 1.5402          | 94.5055 |
+| 1.3721        | 0.1   | 20   | 1.0021          | 75.8242 |
+| 0.9921        | 0.15  | 30   | 0.8322          | 75.0916 |
+| 0.9844        | 0.2   | 40   | 0.8080          | 72.8938 |
+| 0.7071        | 0.25  | 50   | 0.7862          | 77.2894 |
+| 0.7998        | 0.3   | 60   | 0.7052          | 68.8645 |
+| 0.6935        | 0.35  | 70   | 0.6781          | 64.2857 |
+| 0.81          | 0.4   | 80   | 0.6341          | 63.5531 |
+| 0.6133        | 0.45  | 90   | 0.6083          | 62.6374 |
+| 0.6675        | 0.5   | 100  | 0.5851          | 62.8205 |
+| 0.5577        | 0.55  | 110  | 0.5651          | 59.3407 |
+| 0.6473        | 0.6   | 120  | 0.5638          | 58.0586 |
+| 0.6018        | 0.65  | 130  | 0.5434          | 53.8462 |
+| 0.5918        | 0.7   | 140  | 0.5385          | 54.9451 |
+| 0.5654        | 0.75  | 150  | 0.5200          | 58.0586 |
+| 0.587         | 0.8   | 160  | 0.4974          | 57.1429 |
+| 0.6157        | 0.85  | 170  | 0.4834          | 53.2967 |
+| 0.6803        | 0.9   | 180  | 0.4852          | 55.8608 |
+| 0.4813        | 0.95  | 190  | 0.4686          | 51.2821 |
+| 0.4952        | 1.0   | 200  | 0.4624          | 51.4652 |
 ### Framework versions

train.log CHANGED Viewed

@@ -207,3 +207,5 @@
 {'loss': 0.4813, 'learning_rate': 6.842105263157896e-06, 'epoch': 0.95}
 {'eval_loss': 0.4685819447040558, 'eval_wer': 51.28205128205128, 'eval_runtime': 17.9367, 'eval_samples_per_second': 3.568, 'eval_steps_per_second': 0.112, 'epoch': 0.95}
 {'loss': 0.4952, 'learning_rate': 1.5789473684210528e-06, 'epoch': 1.0}

 {'loss': 0.4813, 'learning_rate': 6.842105263157896e-06, 'epoch': 0.95}
 {'eval_loss': 0.4685819447040558, 'eval_wer': 51.28205128205128, 'eval_runtime': 17.9367, 'eval_samples_per_second': 3.568, 'eval_steps_per_second': 0.112, 'epoch': 0.95}
 {'loss': 0.4952, 'learning_rate': 1.5789473684210528e-06, 'epoch': 1.0}
+{'eval_loss': 0.4624484181404114, 'eval_wer': 51.46520146520146, 'eval_runtime': 19.165, 'eval_samples_per_second': 3.339, 'eval_steps_per_second': 0.104, 'epoch': 1.0}
+{'train_runtime': 2053.4009, 'train_samples_per_second': 3.117, 'train_steps_per_second': 0.097, 'train_loss': 0.8012711083889008, 'epoch': 1.0}