|
--- |
|
license: apache-2.0 |
|
datasets: |
|
- mozilla-foundation/common_voice_11_0 |
|
language: |
|
- ur |
|
base_model: |
|
- openai/whisper-medium |
|
pipeline_tag: automatic-speech-recognition |
|
library_name: transformers |
|
--- |
|
|
|
|
|
# Whisper Medium Urdu Model |
|
|
|
This model is a fine-tuned version of OpenAI's Whisper model for automatic speech recognition (ASR) in **Urdu**. It is trained on various audio datasets and is designed to convert spoken Urdu language into text. |
|
|
|
## Model Description |
|
|
|
The **Whisper** model is a general-purpose ASR system trained on a large multilingual dataset, capable of transcribing speech to text in many languages, including Urdu. This specific model has been fine-tuned on Urdu audio datasets for better accuracy with Urdu speech inputs. |
|
|
|
### Key Features: |
|
- **Language:** Urdu |
|
- **Model Type:** Whisper medium model |
|
- **Task:** Automatic Speech Recognition (ASR) |
|
- **Training Data:** The model was trained on a diverse set of Urdu speech data. |
|
|
|
## Intended Use |
|
|
|
This model is intended for automatic transcription of Urdu speech to text. It can be used for applications such as: |
|
- Speech-to-text transcription in Urdu |
|
- Transcription for Urdu audio or video content |
|
- Accessibility features for Urdu-speaking users |
|
|
|
## How to Use |
|
|
|
You can easily use the model with the Hugging Face `transformers` library: |
|
|
|
```python |
|
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq |
|
|
|
# Load the model and processor |
|
processor = AutoProcessor.from_pretrained("Abdul145/whisper-medium-urdu-custom") |
|
model = AutoModelForSpeechSeq2Seq.from_pretrained("Abdul145/whisper-medium-urdu-custom") |
|
|