Vivim: a Video Vision Mamba for Medical Video Object Segmentation Paper • 2401.14168 • Published Jan 25, 2024 • 2
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 188
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5, 2024 • 60