To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning Paper • 2311.07574 • Published Nov 13, 2023 • 15
CSGO: Content-Style Composition in Text-to-Image Generation Paper • 2408.16766 • Published Aug 29, 2024 • 18
CogVLM2: Visual Language Models for Image and Video Understanding Paper • 2408.16500 • Published Aug 29, 2024 • 57