Spaces:
Sleeping
Sleeping
title: ai-audio-books | |
emoji: ππ¨βπ»π§ | |
colorFrom: blue | |
colorTo: gray | |
sdk: gradio | |
sdk_version: 4.44.1 | |
app_file: app.py | |
pinned: false | |
python_version: 3.11 | |
### Action items | |
- check new set of voices | |
- intonations | |
- add context | |
- audio effects | |
- add context | |
- filter, apply only for long phrases | |
- only for narrator? | |
- checkbox! make effects great again (no) optional | |
- limit on input text size | |
- fix UI | |
- slides / story | |
- optimizations | |
- combine sequential phrases of same character in single phrase | |
- support large texts. use batching. problem: how to ensure same characters? | |
- can detect characters in first prompt, then split text in each batch into character phrases | |
- probably split large phrases into smaller ones | |
- identify unknown characters | |
- use LLM to recognize characters for a given text and provide descriptions detailed enough to select appropriate voice | |