Data Engineering for Scaling Language Models to 128K Context Paper • 2402.10171 • Published Feb 15, 2024 • 23
StripedHyena Collection The collection of all hyena hybrids. • 4 items • Updated Feb 25, 2024 • 5
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 114
Tulu V2 Suite Collection The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" • 19 items • Updated Nov 27, 2024 • 43
🚀GGUF Collection Llama.cpp compatible models, can be used on CPUs and GPUs! • 987 items • Updated about 7 hours ago • 35
BLING Models Collection Small CPU-based RAG-optimized, instruct-following 1B-3B parameter models • 27 items • Updated Oct 28, 2024 • 26
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31, 2024 • 506