JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation Paper โข 2410.17250 โข Published Oct 22, 2024 โข 14
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper โข 2410.16153 โข Published Oct 21, 2024 โข 44
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey Paper โข 2407.21794 โข Published Jul 31, 2024 โข 5
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey Paper โข 2407.21794 โข Published Jul 31, 2024 โข 5
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey Paper โข 2407.21794 โข Published Jul 31, 2024 โข 5 โข 2
view post Post 2686 A great vision language benchmark: MM-UPD evaluates how model responds to unsolvable problems ๐ค LLaVA 1.6 is outperforming proprietary VLMs, making it a very robust choice for production!It is now hosted as a leaderboard MM-UPD/MM-UPD_Leaderboard ๐๐ ๐ 8 8 + Reply
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models Paper โข 2403.20331 โข Published Mar 29, 2024 โข 14
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models Paper โข 2403.20331 โข Published Mar 29, 2024 โข 14