Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 9 days ago • 37
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published Nov 15, 2024 • 68
InternVL2.5-MPO Collection Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated about 23 hours ago • 23
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling Paper • 2407.21787 • Published Jul 31, 2024 • 12
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 19 days ago • 79
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6, 2024 • 53