Llama 3.2 Collection Meta goes small with Llama3.2, both text only 1B and 3B, and the 11B Vision models. • 15 items • Updated 20 days ago • 10
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination Paper • 2411.03823 • Published Nov 6, 2024 • 43
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Paper • 2411.03562 • Published Nov 5, 2024 • 64
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 14 days ago • 197
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Paper • 2404.16710 • Published Apr 25, 2024 • 75
Josiefied and Abliterated Collection Abliterated, and further fine-tuned to be the most uncensored models available. • 13 items • Updated 17 days ago • 4
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated Nov 27, 2024 • 291
Llama 3.2 Collection Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 23 items • Updated 12 days ago • 46
Mamba Collection Mamba is a new LLM architecture that integrates the Structured State Space sequence model to manage lengthy data sequences. • 11 items • Updated Oct 12, 2024 • 1
Transformers compatible Mamba Collection This release includes the `mamba` repositories compatible with the `transformers` library • 5 items • Updated Mar 6, 2024 • 37