MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper • 2501.12380 • Published about 23 hours ago • 47
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper • 2501.12380 • Published about 23 hours ago • 47
freesky/InternVL-Chat-V1-5_ft_by_DecoVQAplus_SelectiveLoss Visual Question Answering • Updated Oct 6, 2024 • 4
Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers Paper • 2305.14987 • Published May 24, 2023 • 1
Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers Paper • 2305.14987 • Published May 24, 2023 • 1
Visual Question Decomposition on Multimodal Large Language Models Paper • 2409.19339 • Published Sep 28, 2024 • 8
Visual Question Decomposition on Multimodal Large Language Models Paper • 2409.19339 • Published Sep 28, 2024 • 8
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 355