MiniMax-01: Scaling Foundation Models with Lightning Attention Paper β’ 2501.08313 β’ Published 5 days ago β’ 259
Running 143 π Whisper Large V3 Turbo WebGPU ML-powered speech recognition directly in your browser
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper β’ 2409.12183 β’ Published Sep 18, 2024 β’ 37
Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models Paper β’ 2408.02442 β’ Published Aug 5, 2024 β’ 21
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging By akjindal53244 β’ Aug 19, 2024 β’ 76
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Paper β’ 2408.06195 β’ Published Aug 12, 2024 β’ 69
Running 242 βΎοΈπ Infinite Dataset Hub Search and save datasets generated with a LLM in real time
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper β’ 2405.01535 β’ Published May 2, 2024 β’ 121