Spaces:
Sleeping
Sleeping
navalnica
commited on
Commit
Β·
9e4cbc3
1
Parent(s):
4eeec98
upd readme
Browse files
README.md
CHANGED
@@ -11,32 +11,22 @@ python_version: 3.11
|
|
11 |
---
|
12 |
|
13 |
### Action items
|
14 |
-
-
|
15 |
-
-
|
16 |
-
-
|
17 |
-
-
|
18 |
-
-
|
19 |
-
-
|
20 |
-
-
|
21 |
-
-
|
22 |
-
-
|
23 |
-
-
|
|
|
24 |
- optimizations
|
25 |
-
-
|
26 |
-
-
|
27 |
-
can detect characters in first prompt, then split text in each batch into character phrases
|
28 |
-
-
|
29 |
-
|
30 |
-
|
31 |
-
- [ ] prepare text for TTS
|
32 |
-
- [x] prepare prompt to split text into character phrases
|
33 |
-
- [ ] split large text in batches, process each batch separatelly, concat batches
|
34 |
-
- [ ] try to identify unknown characters
|
35 |
-
- [ ] select voices for TTS
|
36 |
-
- [ ] map characters to available voices
|
37 |
-
- [ ] use LLM to recognize characters for a given text and provide descriptions
|
38 |
-
detailed enough to select appropriate voice
|
39 |
-
- [ ] preprocess text phrases for TTS: add intonation markers, auto-set TTS params
|
40 |
-
- [ ] run TTS to create narration
|
41 |
-
- [ ] add effects. mix them with created narration
|
42 |
|
|
|
11 |
---
|
12 |
|
13 |
### Action items
|
14 |
+
- check new set of voices
|
15 |
+
- intonations
|
16 |
+
- add context
|
17 |
+
- audio effects
|
18 |
+
- add context
|
19 |
+
- filter, apply only for long phrases
|
20 |
+
- only for narrator?
|
21 |
+
- checkbox! make effects great again (no) optional
|
22 |
+
- limit on input text size
|
23 |
+
- fix UI
|
24 |
+
- slides / story
|
25 |
- optimizations
|
26 |
+
- combine sequential phrases of same character in single phrase
|
27 |
+
- support large texts. use batching. problem: how to ensure same characters?
|
28 |
+
- can detect characters in first prompt, then split text in each batch into character phrases
|
29 |
+
- probably split large phrases into smaller ones
|
30 |
+
- identify unknown characters
|
31 |
+
- use LLM to recognize characters for a given text and provide descriptions detailed enough to select appropriate voice
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
|