File size: 1,588 Bytes

026cbb8

---
license: apache-2.0
datasets:
- mozilla-foundation/common_voice_11_0
language:
- ur
base_model:
- openai/whisper-medium
pipeline_tag: automatic-speech-recognition
library_name: transformers
---


# Whisper Medium Urdu Model

This model is a fine-tuned version of OpenAI's Whisper model for automatic speech recognition (ASR) in **Urdu**. It is trained on various audio datasets and is designed to convert spoken Urdu language into text.

## Model Description

The **Whisper** model is a general-purpose ASR system trained on a large multilingual dataset, capable of transcribing speech to text in many languages, including Urdu. This specific model has been fine-tuned on Urdu audio datasets for better accuracy with Urdu speech inputs.

### Key Features:
- **Language:** Urdu
- **Model Type:** Whisper medium model
- **Task:** Automatic Speech Recognition (ASR)
- **Training Data:** The model was trained on a diverse set of Urdu speech data.

## Intended Use

This model is intended for automatic transcription of Urdu speech to text. It can be used for applications such as:
- Speech-to-text transcription in Urdu
- Transcription for Urdu audio or video content
- Accessibility features for Urdu-speaking users

## How to Use

You can easily use the model with the Hugging Face `transformers` library:

```python
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq

# Load the model and processor
processor = AutoProcessor.from_pretrained("Abdul145/whisper-medium-urdu-custom")
model = AutoModelForSpeechSeq2Seq.from_pretrained("Abdul145/whisper-medium-urdu-custom")