Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2407.14358

For Content Creator

Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era

Paper • 2305.06131 • Published May 10, 2023 • 2
Perpetual Humanoid Control for Real-time Simulated Avatars

Paper • 2305.06456 • Published May 10, 2023 • 1
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Paper • 2305.10973 • Published May 18, 2023 • 33
LDM3D: Latent Diffusion Model for 3D

Paper • 2305.10853 • Published May 18, 2023 • 10

generative audio

Taming Data and Transformers for Audio Generation

Paper • 2406.19388 • Published Jun 27, 2024
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Paper • 2406.11768 • Published Jun 17, 2024 • 20
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation

Paper • 2407.02869 • Published Jul 3, 2024 • 18
Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25

Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25

audio-model-use

stabilityai/stable-audio-open-1.0

Text-to-Audio • Updated Jul 31, 2024 • 23.6k • 1.03k
Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25
facebook/musicgen-small

Text-to-Audio • Updated Nov 17, 2023 • 54.5k • • 388

stabilityai/stable-audio-open-1.0

Text-to-Audio • Updated Jul 31, 2024 • 23.6k • 1.03k
Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25

Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25
Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15, 2024 • 56
kyutai/moshiko-pytorch-bf16

Updated Sep 18, 2024 • 126k • 156
Presto! Distilling Steps and Layers for Accelerating Music Generation

Paper • 2410.05167 • Published Oct 7, 2024 • 16

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Paper • 2407.15841 • Published Jul 22, 2024 • 40
Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25
PlacidDreamer: Advancing Harmony in Text-to-3D Generation

Paper • 2407.13976 • Published Jul 19, 2024 • 5
Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Paper • 2407.14329 • Published Jul 19, 2024 • 5

Audio generation

Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25

Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25

Autoregressive Speech Synthesis without Vector Quantization

Paper • 2407.08551 • Published Jul 11, 2024 • 14
Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs