File size: 3,155 Bytes
2403d40 a718bb7 2403d40 a718bb7 2403d40 a718bb7 2403d40 60d01b0 a718bb7 2403d40 a718bb7 2403d40 a718bb7 2403d40 dc726f3 a718bb7 2403d40 a718bb7 2403d40 a718bb7 2403d40 a718bb7 2403d40 a718bb7 2403d40 a718bb7 2403d40 a718bb7 2403d40 a718bb7 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 |
---
library_name: transformers
license: gpl-3.0
datasets:
- MohamedRashad/arabic-english-code-switching
language:
- ar
- en
metrics:
- wer
pipeline_tag: automatic-speech-recognition
---
# π³ Arabic-Whisper-CodeSwitching-Edition
This model is a fine-tuned version of [Whisper Large v2 by OpenAI](https://huggingface.co/openai/whisper-large-v2), trained on an [Arabic-English-code-switching](https://huggingface.co/datasets/MohamedRashad/arabic-english-code-switching) dataset.
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6116d0584ef9fdfbf45dc4d9/w5AXicC8X3kK1AC30OVmH.png)
## π Model Details
### Model Description
The Arabic-Whisper-CodeSwitching-Edition is designed to handle Arabic audio with embedded English words. This model enhances the original Whisper Large v2 by improving its performance on Arabic-English code-switching speech
- **Developed by:** Ψ§ΩΨΉΨ¨Ψ― ΩΩΩ
- **Model type:** Speech Recognition
- **Language(s) (NLP):** Arabic, English (in the context of Arabic audio)
- **License:** GPL-3.0
### Model Sources [optional]
<!-- Provide the basic links for the model. -->
- **Repository for data collection:** https://github.com/MohamedAliRashad/youtube-audio-collector
- **Demo:** https://huggingface.co/spaces/MohamedRashad/Arabic-Whisper-CodeSwitching-Edition
## π· Uses
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
### Direct Use
The model can be used directly for transcribing Arabic speech that includes English words. It is particularly useful in multilingual environments where code-switching is common.
### Out-of-Scope Use
The model may not perform well on monolingual speech in languages other than Arabic or English, or on speech with code-switching in languages other than Arabic and English.
## π¨ Bias, Risks, and Limitations
### Recommendations
Users (both direct and downstream) should be made aware of the risks, biases, and limitations of the model. More information needed for further recommendations.
## π How to Get Started with the Model
Use the code below to get started with the model.
```python
from transformers import WhisperForConditionalGeneration, WhisperProcessor
processor = WhisperProcessor.from_pretrained("MohamedRashad/Arabic-Whisper-CodeSwitching-Edition")
model = WhisperForConditionalGeneration.from_pretrained("MohamedRashad/Arabic-Whisper-CodeSwitching-Edition")
# Example usage
inputs = processor("path_to_audio_file.wav", return_tensors="pt")
generated_ids = model.generate(inputs["input_features"])
transcription = processor.batch_decode(generated_ids, skip_special_tokens=True)
print(transcription)
```
## π¨βπ Citation
### BibTeX:
```bibtex
@misc{rashad2024arabicwhisper,
title={Arabic-Whisper-CodeSwitching-Edition},
author={Mohamed Rashad},
year={2024},
url={https://huggingface.co/spaces/MohamedRashad/Arabic-Whisper-CodeSwitching-Edition},
}
```
### APA:
Rashad, M. (2024). Arabic-Whisper-CodeSwitching-Edition. Retrieved from https://huggingface.co/spaces/MohamedRashad/Arabic-Whisper-CodeSwitching-Edition
|