ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published 26 days ago • 72
PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion Paper • 2412.17780 • Published 11 days ago • 3
Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published 16 days ago • 7
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 16 days ago • 75
view article Article Comparing Open-source and Proprietary LLMs in Medical AI By mpimentel • Oct 3, 2024 • 16
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models Paper • 2412.02980 • Published Dec 4, 2024 • 12
Insight-V Collection Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models • 5 items • Updated Nov 22, 2024 • 9
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101
Biomedical Collection Models for biomedical research applications, such as radiology report generation and biomedical language understanding. • 9 items • Updated Nov 1, 2024 • 6
FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI Paper • 2411.04872 • Published Nov 7, 2024 • 4
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 12 days ago • 197
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 17 items • Updated 12 days ago • 93
inftyBench: Extending Long Context Evaluation Beyond 100K Tokens Paper • 2402.13718 • Published Feb 21, 2024 • 1
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations Paper • 2408.08459 • Published Aug 15, 2024 • 45