matlok
's Collections
Papers - Audio - TTS
updated
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram
Predictions
Paper
•
1712.05884
•
Published
•
2
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
Paper
•
2403.16973
•
Published
•
2
High Fidelity Neural Audio Compression
Paper
•
2210.13438
•
Published
•
4
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting
for Text-to-Speech Synthesis
Paper
•
2404.03204
•
Published
•
7
Qwen-Audio: Advancing Universal Audio Understanding via Unified
Large-Scale Audio-Language Models
Paper
•
2311.07919
•
Published
•
9
suno/bark
Text-to-Speech
•
Updated
•
75k
•
•
1.21k
Natural language guidance of high-fidelity text-to-speech with synthetic
annotations
Paper
•
2402.01912
•
Published
•
11
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow
Matching
Paper
•
2410.06885
•
Published
•
43
Matcha-TTS: A fast TTS architecture with conditional flow matching
Paper
•
2309.03199
•
Published
•
11
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
Paper
•
2308.11466
•
Published
•
1