Arctic-embed Collection A collection of text embedding models optimized for retrieval accuracy and efficiency • 8 items • Updated 29 days ago • 17
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22, 2024 • 253
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design Paper • 2401.14112 • Published Jan 25, 2024 • 18