VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models Paper • 2412.19645 • Published 10 days ago • 13
ColorFlow: Retrieval-Augmented Image Sequence Colorization Paper • 2412.11815 • Published 21 days ago • 26
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Paper • 2412.09283 • Published 25 days ago • 19
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 23 days ago • 136
LoRACLR: Contrastive Adaptation for Customization of Diffusion Models Paper • 2412.09622 • Published 24 days ago • 7
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM Paper • 2412.09618 • Published 24 days ago • 21
Multimodal Latent Language Modeling with Next-Token Diffusion Paper • 2412.08635 • Published 25 days ago • 41
StyleMaster: Stylize Your Video with Artistic Generation and Translation Paper • 2412.07744 • Published 27 days ago • 19
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance Paper • 2412.05355 • Published about 1 month ago • 7
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation Paper • 2412.06781 • Published 27 days ago • 18
AMO Sampler: Enhancing Text Rendering with Overshooting Paper • 2411.19415 • Published Nov 28, 2024 • 3
ObjCtrl-2.5D: Training-free Object Control with Camera Poses Paper • 2412.07721 • Published 27 days ago • 8
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published 27 days ago • 46
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper • 2412.07774 • Published 26 days ago • 25
ROICtrl: Boosting Instance Control for Visual Generation Paper • 2411.17949 • Published Nov 27, 2024 • 82
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning Paper • 2411.05003 • Published Nov 7, 2024 • 70