flow2023
's Collections
generation-diffusion
updated
High-Quality Image Restoration Following Human Instructions
Paper
•
2401.16468
•
Published
•
12
Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with
Prototypical Embedding
Paper
•
2401.15708
•
Published
•
11
Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with
Large Vision-Language Model Support
Paper
•
2401.14688
•
Published
•
13
TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And
Image-Prompts
Paper
•
2401.14828
•
Published
•
7
BootPIG: Bootstrapping Zero-shot Personalized Image Generation
Capabilities in Pretrained Diffusion Models
Paper
•
2401.13974
•
Published
•
12
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
Generating with Multimodal LLMs
Paper
•
2401.11708
•
Published
•
30
DiffusionGPT: LLM-Driven Text-to-Image Generation System
Paper
•
2401.10061
•
Published
•
29
WorldDreamer: Towards General World Models for Video Generation via
Predicting Masked Tokens
Paper
•
2401.09985
•
Published
•
15
Improving fine-grained understanding in image-text pre-training
Paper
•
2401.09865
•
Published
•
16
StableDrag: Stable Dragging for Point-based Image Editing
Paper
•
2403.04437
•
Published
•
25
AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Paper
•
2403.14468
•
Published
•
23
ControlNet++: Improving Conditional Controls with Efficient Consistency
Feedback
Paper
•
2404.07987
•
Published
•
47
Customizing Text-to-Image Models with a Single Image Pair
Paper
•
2405.01536
•
Published
•
18
Autoregressive Model Beats Diffusion: Llama for Scalable Image
Generation
Paper
•
2406.06525
•
Published
•
66