Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 17 days ago • 116
All open-source models available on Workers AI Collection Read Developer Docs to get started https://developers.cloudflare.com/workers-ai/models/ • 38 items • Updated Apr 2, 2024 • 4
HF-curated models available on Workers AI Collection A collection of models curated with Hugging Face that can be run on Cloudflare's Workers AI serverless inference platform. • 15 items • Updated Apr 2, 2024 • 50
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated 7 days ago • 117
Model Stock: All we need is just a few fine-tuned models Paper • 2403.19522 • Published Mar 28, 2024 • 10
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 257