CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval Paper • 2411.12644 • Published Nov 19, 2024 • 3
view article Article Python Is All You Need? Introducing Dria-Agent-α By andthattoo • 10 days ago • 22
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Paper • 2501.01028 • Published 19 days ago • 12
view article Article Synthetic Data Generation with FastData and Hugging Face By asoria • 13 days ago • 14
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • 18 days ago • 31
Dolphin 3.0 Collection Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 7 items • Updated 16 days ago • 56
view article Article 🐺🐦⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram • 18 days ago • 38
GLiNER Collection Knowledgator GLiNER models for information extraction • 8 items • Updated Dec 9, 2024 • 9
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 22 days ago • 23
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 23 days ago • 11
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated Dec 18, 2024 • 48
Smol but mighty Collection A collection of smoll but mighty models • 10 items • Updated Dec 19, 2024 • 4
LLaMat Collection Foundational Large Language Models for Materials Research • 6 items • Updated Dec 13, 2024 • 3