Spaces:
Running
title: Ukrainian TTS
emoji: π
colorFrom: blue
colorTo: yellow
sdk: gradio
sdk_version: 3.3
python_version: 3.9
app_file: app.py
pinned: false
Ukrainian TTS π’π€
Ukrainian TTS (text-to-speech) using Coqui TTS.
Link to online demo -> https://huggingface.co/spaces/robinhad/ukrainian-tts Link to source code and models -> https://github.com/robinhad/ukrainian-tts
Code is licensed under MIT License
, models are under GNU GPL v3 License
.
Support
If you like my work, please support -> https://send.monobank.ua/jar/48iHq4xAXm
For collaboration and question please contact me here: Telegram https://t.me/robinhad Twitter https://twitter.com/robinhad
You're welcome to join UA Speech Recognition and Synthesis community: Telegram https://t.me/speech_recognition_uk
Examples
Mykyta (male)
:
https://user-images.githubusercontent.com/5759207/178158485-29a5d496-7eeb-4938-8ea7-c345bc9fed57.mp4
Olena (female)
:
https://user-images.githubusercontent.com/5759207/178158492-8504080e-2f13-43f1-83f0-489b1f9cd66b.mp4
How to use:
pip install -r requirements.txt
.- Download model from "Releases" tab.
- Launch as one-time command:
tts --text "Text for TTS" \
--model_path path/to/model.pth \
--config_path path/to/config.json \
--out_path folder/to/save/output.wav
or alternatively launch web server using:
tts-server --model_path path/to/model.pth \
--config_path path/to/config.json
How to train: ποΈ
- Refer to "Nervous beginner guide" in Coqui TTS docs.
- Instead of provided
config.json
use one from this repo.
Attribution π€
- Model training - Yurii Paniv @robinhad
- Mykyta, Olena, Lada, Dmytro, Olha dataset - Yehor Smoliakov @egorsmkv
- Dmytro voice - Dmytro Chaplynskyi @dchaplinsky
- Silence cutting using HMM-GMM - Volodymyr Kyrylov @proger
- Autostress (with dictionary) using ukrainian-word-stress - Oleksiy Syvokon @asivokon
- Autostress (with model) using ukrainian-accentor - Bohdan Mykhailenko @NeonBohdan + Yehor Smoliakov @egorsmkv