Aira Collection Aira is a series of chatbots developed as an experimentation playground for value alignment. • 27 items • Updated Jun 20, 2024 • 1
Loxa Collection a Loxa family models are best models to running on CPU and GPU with high quality(=>92% accuracy) • 4 items • Updated 5 days ago • 2
Quadrifoglio 🍀 Collection Small text2text models finetuned on Italian machine translation tasks. • 6 items • Updated 9 days ago • 1
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 125
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 48
FluidML: Fast and Memory Efficient Inference Optimization Paper • 2411.09242 • Published Nov 14, 2024 • 1
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 58
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 29 days ago • 204
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 562
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Dec 13, 2024 • 82
Trained Models 🏋️ Collection They may be small, but they're training like giants! • 8 items • Updated Dec 3, 2024 • 17
Minerva LLMs Collection The first family of LLMs pretrained from scratch on Italian. • 6 items • Updated Dec 7, 2024 • 33