MiniMax-01: Scaling Foundation Models with Lightning Attention Paper โข 2501.08313 โข Published 7 days ago โข 263
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. โข 26 items โข Updated 23 minutes ago โข 30
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation โข Updated about 6 hours ago โข 40.3k โข 222
๐ FineMath Collection FineMath datasets and ablation models โข 14 items โข Updated 15 days ago โข 17