view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ about 1 month ago β’ 75
view article Article Bridging the Gap Between Physical Numerical Simulations and Machine Learning: Introducing The Well By rubenohana β’ Dec 2, 2024 β’ 17
π¬ Video models Collection text-to-video & image-to-video models released by the Chinese community β’ 22 items β’ Updated 11 days ago β’ 4
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. β’ 6 items β’ Updated 32 minutes ago β’ 149
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation Paper β’ 2408.12528 β’ Published Aug 22, 2024 β’ 50
Jamba-1.5 Collection The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models β’ 2 items β’ Updated Aug 22, 2024 β’ 83
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context Jul 23, 2024 β’ 225
VITA: Towards Open-Source Interactive Omni Multimodal LLM Paper β’ 2408.05211 β’ Published Aug 9, 2024 β’ 47
Llama-3.1 Quantization Collection Neural Magic quantized Llama-3.1 models β’ 22 items β’ Updated Nov 22, 2024 β’ 42
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne β’ Jul 29, 2024 β’ 260
view article Article ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models By yuchenlin β’ Jul 27, 2024 β’ 27
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated 29 days ago β’ 637
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper β’ 2406.16860 β’ Published Jun 24, 2024 β’ 59
SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. β’ 5 items β’ Updated 32 minutes ago β’ 26
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. β’ 4 items β’ Updated 32 minutes ago β’ 161