-
Gemini: A Family of Highly Capable Multimodal Models
Paper • 2312.11805 • Published • 45 -
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis
Paper • 2312.13314 • Published • 8 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259 -
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Paper • 2312.09911 • Published • 54
Collections
Discover the best community collections!
Collections including paper arxiv:2412.09626