Rui Zhao's picture

Rui Zhao

ruizhaocv

·

https://ruizhaocv.github.io/

AI & ML interests

Multimodal and GenAI

Recent Activity

upvoted a paper 3 days ago

VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

upvoted a paper 20 days ago

BrushEdit: All-In-One Image Inpainting and Editing

upvoted a paper 20 days ago

Wonderland: Navigating 3D Scenes from a Single Image

View all activity

Organizations

ruizhaocv's activity

upvoted a paper 3 days ago

VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

Paper • 2412.19645 • Published 10 days ago • 13

upvoted 3 papers 20 days ago

BrushEdit: All-In-One Image Inpainting and Editing

Paper • 2412.10316 • Published 24 days ago • 33

Wonderland: Navigating 3D Scenes from a Single Image

Paper • 2412.12091 • Published 20 days ago • 15

ColorFlow: Retrieval-Augmented Image Sequence Colorization

Paper • 2412.11815 • Published 21 days ago • 26

upvoted 3 papers 21 days ago

InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption

Paper • 2412.09283 • Published 25 days ago • 19

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 23 days ago • 136

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 24 days ago • 87

upvoted 3 papers 24 days ago

LoRACLR: Contrastive Adaptation for Customization of Diffusion Models

Paper • 2412.09622 • Published 24 days ago • 7

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published 24 days ago • 21

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published 25 days ago • 41

upvoted a paper 25 days ago

StyleMaster: Stylize Your Video with Artistic Generation and Translation

Paper • 2412.07744 • Published 27 days ago • 19

upvoted 7 papers 26 days ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 57

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

Paper • 2412.05355 • Published about 1 month ago • 7

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Paper • 2412.06781 • Published 27 days ago • 18

AMO Sampler: Enhancing Text Rendering with Overshooting

Paper • 2411.19415 • Published Nov 28, 2024 • 3

ObjCtrl-2.5D: Training-free Object Control with Camera Poses

Paper • 2412.07721 • Published 27 days ago • 8

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 27 days ago • 46

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics

Paper • 2412.07774 • Published 26 days ago • 25

upvoted a paper about 1 month ago

ROICtrl: Boosting Instance Control for Visual Generation

Paper • 2411.17949 • Published Nov 27, 2024 • 82

upvoted a paper about 2 months ago

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

Paper • 2411.05003 • Published Nov 7, 2024 • 70