Multimodal Language Model
More advanced and challenging multi-task evaluation
VLMEvalKit Evaluation Results Collection