Spaces:
Sleeping
Sleeping
File size: 1,046 Bytes
573cbc4 c6f5cf9 573cbc4 84211a6 95849c2 9e4cbc3 95849c2 c2fa877 9e4cbc3 95849c2 84211a6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 |
---
title: ai-audio-books
emoji: ππ¨βπ»π§
colorFrom: blue
colorTo: gray
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: false
python_version: 3.11
---
### Action items
- voices
- filter to use only best voices
- intonations
- add context
- audio effects
- add context
- filter, apply only for long phrases
- only for narrator?
- stability
- add limit on input text size (5000 chars)
- improve UI
- add error box
- show character parts
- prepare slides / story
- testing
- eval current execution time
- optimizations
- combine sequential phrases of same character in single phrase
- support large texts. use batching. problem: how to ensure same characters?
- can detect characters in first prompt, then split text in each batch into character phrases
- probably split large phrases into smaller ones
- identify unknown characters
- use LLM to recognize characters for a given text and provide descriptions detailed enough to select appropriate voice
|