The Unbearable Slowness of Being: Why do we live at 10 bits/s? Paper • 2408.10234 • Published Aug 3, 2024 • 1
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published 27 days ago • 66
Discovering Preference Optimization Algorithms with and for Large Language Models Paper • 2406.08414 • Published Jun 12, 2024 • 14
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations Paper • 2411.00640 • Published Nov 1, 2024 • 3
The Prompt Report: A Systematic Survey of Prompting Techniques Paper • 2406.06608 • Published Jun 6, 2024 • 56
Training language models to follow instructions with human feedback Paper • 2203.02155 • Published Mar 4, 2022 • 16
RULER: What's the Real Context Size of Your Long-Context Language Models? Paper • 2404.06654 • Published Apr 9, 2024 • 34
Cosmos Tokenizer Collection A suite of image and video tokenizers • 12 items • Updated about 10 hours ago • 29
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 17 items • Updated 14 days ago • 93
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22, 2024 • 253
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping Paper • 2402.14083 • Published Feb 21, 2024 • 47