Salesforce/xgen-mm-phi3-mini-instruct-dpo-r-v1.5 Image-Text-to-Text • Updated Sep 16, 2024 • 110 • 17
HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit Zero-Shot Image Classification • Updated Mar 7, 2024 • 7.85k • 43
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Paper • 2501.05452 • Published 12 days ago • 14
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 12 days ago • 80
timm/vit_base_patch16_clip_224.openai Image Feature Extraction • Updated Oct 23, 2024 • 225k • 6
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper • 2501.01427 • Published 19 days ago • 49