OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. β’ 8 items β’ Updated Nov 23, 2024 β’ 79
To Code, or Not To Code? Exploring Impact of Code in Pre-training Paper β’ 2408.10914 β’ Published Aug 20, 2024 β’ 41
OLMo Suite Collection Artifacts for the first set of OLMo models. β’ 18 items β’ Updated about 22 hours ago β’ 70
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. β’ 4 items β’ Updated about 14 hours ago β’ 161
The Prompt Report: A Systematic Survey of Prompting Techniques Paper β’ 2406.06608 β’ Published Jun 6, 2024 β’ 58
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 16 items β’ Updated 25 days ago β’ 143
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 23 items β’ Updated 20 days ago β’ 181
view article Article Releasing Swift Transformers: Run On-Device LLMs in Apple Devices Aug 8, 2023 β’ 28
StarCoder 2 and The Stack v2: The Next Generation Paper β’ 2402.19173 β’ Published Feb 29, 2024 β’ 136
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training Paper β’ 2403.09611 β’ Published Mar 14, 2024 β’ 125
Flamingo: a Visual Language Model for Few-Shot Learning Paper β’ 2204.14198 β’ Published Apr 29, 2022 β’ 14
Gemma release Collection Groups the Gemma models released by the Google team. β’ 40 items β’ Updated 25 days ago β’ 328
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement Paper β’ 2402.07456 β’ Published Feb 12, 2024 β’ 42
LoRA: Low-Rank Adaptation of Large Language Models Paper β’ 2106.09685 β’ Published Jun 17, 2021 β’ 31
Specialized Language Models with Cheap Inference from Limited Domain Data Paper β’ 2402.01093 β’ Published Feb 2, 2024 β’ 45
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling Paper β’ 2401.16380 β’ Published Jan 29, 2024 β’ 48
AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn Paper β’ 2306.08640 β’ Published Jun 14, 2023 β’ 26