-
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
Paper • 2309.07915 • Published • 4 -
Skywork: A More Open Bilingual Foundation Model
Paper • 2310.19341 • Published • 6 -
Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V
Paper • 2310.19061 • Published • 8 -
Lumiere: A Space-Time Diffusion Model for Video Generation
Paper • 2401.12945 • Published • 85
Collections
Discover the best community collections!
Collections trending this week