--- title: IPA Transcription Leaderboard emoji: 📝 colorFrom: indigo colorTo: blue sdk: gradio sdk_version: 5.12.0 app_file: app/app.py pinned: true license: agpl-3.0 thumbnail: >- https://cdn-uploads.huggingface.co/production/uploads/61dd07bafdc070745eed96fd/QC0vfJ-i0oc77NAM8Fdjs.png short_description: Speech-to-phoneme leaderboard --- # 🎯 English Phonemic Transcription Leaderboard Welcome to the English Phonemic Transcription Leaderboard! This simple leaderboard helps track and compare the performance of different speech-to-phoneme models. Feel free to fork it for your own hugging face leaderboards! ![leaderboard](img/leaderboard.png) ## ✨ Features * 📊 Interactive leaderboard with real-time sorting * 🔄 Easy model submission system * 📈 Automatic evaluation of submitted models * 📱 Responsive design that works on all devices ## 🎯 What This Project Does This leaderboard tracks two key metrics for phonemic transcription models: * **PER (Phoneme Error Rate)**: How accurately your model converts speech to phonemes * **PWED (Phoneme Weighted Edit Distance)**: A more nuanced metric that considers phonemic features Read more about evaluations on our [blog](https://www.koellabs.com/blog/phonemic-transcription-metrics) Models are evaluated on the TIMIT speech corpus, a gold standard in speech recognition research. ## 🚀 Getting Started Navigate to the hosted version on [Hugging Face](https://huggingface.co/spaces/KoelLabs/IPA-Transcription-EN) or follow the instructions in [DEVELOPMENT.md](DEVELOPMENT.md) to run the leaderboard locally. ## 🎮 Using the Leaderboard ### Submitting a Model 1. Go to the "Submit Model" tab 2. Enter your model details: * Model name (e.g., "wav2vec2-phoneme-wizard") * Submission name (e.g., "MyAwesomeModel v1.0") * GitHub/Kaggle/HuggingFace URL (optional) 3. Click Submit and watch your model climb the ranks! 🚀 ### Checking Model Status 1. Navigate to the "Model Status" tab 2. Enter your model name or task ID 3. Get real-time updates on your model's evaluation progress ## 📊 Understanding the Results The leaderboard shows: * Model names and submission details * PER and PWED scores (lower is better!) * Links to model repositories * Submission dates Sort by either metric to see who's leading the pack! ## 🛠️ Technical Details * Built with Gradio for a smooth UI experience * Runs on a basic compute plan (16GB RAM, 2vCPUs) for easy reproducibility * Evaluation can take several hours - perfect time to grab a coffee ☕ ## 🤝 Contributing Want to make this leaderboard even better? We'd love your help! Here are some ways you can contribute: * Add new evaluation metrics * Improve the UI design * Enhance documentation * Submit bug fixes * Add new features Checkout the [CONTRIBUTING.md](CONTRIBUTING.md) for more details. ## 📝 License This project is licensed under the GNU Affero General Public License. We retain all rights to the Koel Labs brand, logos, blog posts and website content. ## 🌟 Acknowledgments * Thanks to the TIMIT speech corpus for providing evaluation data * Shoutout to the [panphon library](https://github.com/dmort27/panphon) for PWED calculations * Built with love by Koel Labs 💙 ## 🆘 Need Help? Got questions? Found a bug? Want to contribute? [Open an issue](https://huggingface.co/spaces/KoelLabs/IPA-Transcription-EN/discussions) or [reach out to us](mailto:info@koellabs.com)! We're here to help make speech recognition evaluation fun and accessible for everyone! Remember: Every great model deserves its moment to shine! 🌟 --- Happy Transcribing! 🎤✨