MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale Paper โข 2412.05237 โข Published Dec 6, 2024 โข 47
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision Paper โข 2411.07199 โข Published Nov 11, 2024 โข 46