navalnica commited on
Commit
9e4cbc3
Β·
1 Parent(s): 4eeec98

upd readme

Browse files
Files changed (1) hide show
  1. README.md +17 -27
README.md CHANGED
@@ -11,32 +11,22 @@ python_version: 3.11
11
  ---
12
 
13
  ### Action items
14
- - [ ] move speaker split to new pipeline
15
- - [ ] env template
16
- - [ ] move from AI/ML api to langchain
17
- - [ ] bugfix w/ 11labs api
18
- - [ ] async synthesis
19
- - [ ] map characters to voices
20
- - [] emotion enrichment: add intonation markers, auto-set TTS params
21
- - [x] generate good enough sound effects for background
22
- - [ ] mix effects with narrration
23
- - [x] allow files uplaod (.txt)
 
24
  - optimizations
25
- - [ ] combine sequential phrases of same character in single phrase
26
- - [ ] support large texts. use batching. problem: how to ensure same characters?
27
- can detect characters in first prompt, then split text in each batch into character phrases
28
- - [ ] probably split large phrases into smaller ones
29
-
30
- ### Backlog
31
- - [ ] prepare text for TTS
32
- - [x] prepare prompt to split text into character phrases
33
- - [ ] split large text in batches, process each batch separatelly, concat batches
34
- - [ ] try to identify unknown characters
35
- - [ ] select voices for TTS
36
- - [ ] map characters to available voices
37
- - [ ] use LLM to recognize characters for a given text and provide descriptions
38
- detailed enough to select appropriate voice
39
- - [ ] preprocess text phrases for TTS: add intonation markers, auto-set TTS params
40
- - [ ] run TTS to create narration
41
- - [ ] add effects. mix them with created narration
42
 
 
11
  ---
12
 
13
  ### Action items
14
+ - check new set of voices
15
+ - intonations
16
+ - add context
17
+ - audio effects
18
+ - add context
19
+ - filter, apply only for long phrases
20
+ - only for narrator?
21
+ - checkbox! make effects great again (no) optional
22
+ - limit on input text size
23
+ - fix UI
24
+ - slides / story
25
  - optimizations
26
+ - combine sequential phrases of same character in single phrase
27
+ - support large texts. use batching. problem: how to ensure same characters?
28
+ - can detect characters in first prompt, then split text in each batch into character phrases
29
+ - probably split large phrases into smaller ones
30
+ - identify unknown characters
31
+ - use LLM to recognize characters for a given text and provide descriptions detailed enough to select appropriate voice
 
 
 
 
 
 
 
 
 
 
 
32