Parler-TTS: Expresso ☕️ Collection Parler-TTS v0.1 fine-tuned on the Expresso dataset, for expressive, voice-consistent generations. • 3 items • Updated Aug 7, 2024 • 6
Vision-Language Modeling Collection Our datasets and models for Visual-Language Modeling • 5 items • Updated Nov 25, 2024 • 6
METR: Image Watermarking with Large Number of Unique Messages Paper • 2408.08340 • Published Aug 15, 2024 • 5
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models Paper • 2308.04729 • Published Aug 9, 2023 • 31
Efficient Quantization Strategies for Latent Diffusion Models Paper • 2312.05431 • Published Dec 9, 2023 • 11
Photorealistic Video Generation with Diffusion Models Paper • 2312.06662 • Published Dec 11, 2023 • 23
The ArtBench Dataset: Benchmarking Generative Models with Artworks Paper • 2206.11404 • Published Jun 22, 2022 • 2