HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Paper β’ 2412.21199 β’ Published 4 days ago β’ 9
Facilitating large language model Russian adaptation with Learned Embedding Propagation Paper β’ 2412.21140 β’ Published 4 days ago β’ 11
Scaling Test-Time Compute with Open Models Collection Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute β’ 4 items β’ Updated 3 days ago β’ 18
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) β’ 13 items β’ Updated Nov 18, 2024 β’ 183
Recommended small models Collection This is everything recent smaller than ~25B parameters that are high quality/reputable β’ 19 items β’ Updated Nov 30, 2024 β’ 33
Sana Collection β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer β’ 17 items β’ Updated 15 days ago β’ 66
Triangulum Series Collection Reasoning, Long Chain of Thought, Keyword based problem solve β’ 7 items β’ Updated 3 days ago β’ 9
Deepthink and Reasoning Collection Best for Deepthink and Reasoning β’ 10 items β’ Updated 4 days ago β’ 9
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. β’ 8 items β’ Updated 17 days ago β’ 45
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated 15 days ago β’ 112
SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images Paper β’ 2412.17606 β’ Published 12 days ago β’ 5
Efficiently Serving LLM Reasoning Programs with Certaindex Paper β’ 2412.20993 β’ Published 5 days ago β’ 28
Bringing Objects to Life: 4D generation from 3D objects Paper β’ 2412.20422 β’ Published 6 days ago β’ 32
On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper β’ 2412.20070 β’ Published 7 days ago β’ 39
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization Paper β’ 2412.18525 β’ Published 10 days ago β’ 59