Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2412.21187

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 4 days ago • 75
Are Vision-Language Models Truly Understanding Multi-vision Sensor?

Paper • 2412.20750 • Published 7 days ago • 17
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 6 days ago • 25
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 12 days ago • 86

interest_need_read

感兴趣热门论文集合

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 28 days ago • 72
Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published 27 days ago • 25
OpenAI o1 System Card

Paper • 2412.16720 • Published 15 days ago • 29
Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 14 days ago • 41

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 6 days ago • 25
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

Paper • 2412.21037 • Published 7 days ago • 21

Code&Math&Reasoning

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published about 1 month ago • 47
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 14 days ago • 44
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 6 days ago • 25
HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving

Paper • 2412.20735 • Published 7 days ago • 9

Meta-Learning a Dynamical Language Model

Paper • 1803.10631 • Published Mar 28, 2018
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation

Paper • 2003.11963 • Published Mar 26, 2020
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model

Paper • 2212.04960 • Published Dec 9, 2022 • 1
Continuous Learning in a Hierarchical Multiscale Neural Network

Paper • 1805.05758 • Published May 15, 2018 • 1

On Memorization of Large Language Models in Logical Reasoning

Paper • 2410.23123 • Published Oct 30, 2024 • 18
LLMs Do Not Think Step-by-step In Implicit Reasoning

Paper • 2411.15862 • Published Nov 24, 2024 • 8
Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 27 days ago • 66
Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published 13 days ago • 28

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs