Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 6 days ago • 25
ProgCo: Program Helps Self-Correction of Large Language Models Paper • 2501.01264 • Published 4 days ago • 22
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper • 2412.14475 • Published 18 days ago • 52
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 14 days ago • 41
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published 14 days ago • 44
Progressive Multimodal Reasoning via Active Retrieval Paper • 2412.14835 • Published 18 days ago • 69
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing Paper • 2305.11738 • Published May 19, 2023 • 8
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 236 items • Updated 2 days ago • 38
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 24 days ago • 81
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published 20 days ago • 41
Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models Paper • 2412.12606 • Published 20 days ago • 41
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published 22 days ago • 26
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation Paper • 2412.11919 • Published 21 days ago • 33
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions Paper • 2412.09596 • Published 24 days ago • 92