Update README.md
Browse files
README.md
CHANGED
@@ -7,8 +7,11 @@ license: apache-2.0
|
|
7 |
|
8 |
# Wav2vec2-base for Danish
|
9 |
|
10 |
-
This wav2vec2-base model has been pretrained on ~1300 hours of danish speech data. The pretraining data consists of podcasts and audiobooks and is unfortunately not public available. However, we
|
11 |
|
12 |
The pre-training was done using the fairseq library in January 2021.
|
13 |
|
14 |
-
It needs to be fine-tuned
|
|
|
|
|
|
|
|
7 |
|
8 |
# Wav2vec2-base for Danish
|
9 |
|
10 |
+
This wav2vec2-base model has been pretrained on ~1300 hours of danish speech data. The pretraining data consists of podcasts and audiobooks and is unfortunately not public available. However, we were allowed to distribute the pretrained model.
|
11 |
|
12 |
The pre-training was done using the fairseq library in January 2021.
|
13 |
|
14 |
+
It needs to be fine-tuned to perform speech recognition.
|
15 |
+
|
16 |
+
# Finetuning
|
17 |
+
In order to finetune the model to speech recognition, you can draw inspiration from this [notebook tutorial](https://colab.research.google.com/drive/1FjTsqbYKphl9kL-eILgUc-bl4zVThL8F) or [this blog post tutorial](https://huggingface.co/blog/fine-tune-xlsr-wav2vec2).
|