-
UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion
Paper • 2401.13388 • Published • 11 -
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
Paper • 2401.13974 • Published • 12 -
420🏃
Real ESRGAN
-
Vchitect/Vchitect-2.0-2B
Text-to-Video • Updated • 29 • 35
Collections
Discover the best community collections!
Collections including paper arxiv:2401.13388
-
CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion
Paper • 2401.14066 • Published • 8 -
UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion
Paper • 2401.13388 • Published • 11 -
Animated Stickers: Bringing Stickers to Life with Video Diffusion
Paper • 2402.06088 • Published • 9
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper • 2401.09048 • Published • 9 -
Improving fine-grained understanding in image-text pre-training
Paper • 2401.09865 • Published • 16 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper • 2401.10891 • Published • 60 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper • 2401.13627 • Published • 73
-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper • 2311.10093 • Published • 56 -
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper • 2311.12229 • Published • 26 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper • 2311.12908 • Published • 47 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper • 2312.00845 • Published • 36
-
FreeU: Free Lunch in Diffusion U-Net
Paper • 2309.11497 • Published • 64 -
Imagic: Text-Based Real Image Editing with Diffusion Models
Paper • 2210.09276 • Published -
On Architectural Compression of Text-to-Image Diffusion Models
Paper • 2305.15798 • Published • 4 -
Wuerstchen: Efficient Pretraining of Text-to-Image Models
Paper • 2306.00637 • Published • 12