-
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Paper ā¢ 2312.15166 ā¢ Published ā¢ 56 -
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU
Paper ā¢ 2312.12456 ā¢ Published ā¢ 40 -
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper ā¢ 2312.12742 ā¢ Published ā¢ 12 -
Mini-GPTs: Efficient Large Language Models through Contextual Pruning
Paper ā¢ 2312.12682 ā¢ Published ā¢ 8
Collections
Discover the best community collections!
Collections including paper arxiv:2309.11235
-
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ā¢ 2309.11235 ā¢ Published ā¢ 16 -
Orca 2: Teaching Small Language Models How to Reason
Paper ā¢ 2311.11045 ā¢ Published ā¢ 71 -
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Paper ā¢ 2309.12284 ā¢ Published ā¢ 19
-
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Paper ā¢ 2311.03285 ā¢ Published ā¢ 28 -
Tailoring Self-Rationalizers with Multi-Reward Distillation
Paper ā¢ 2311.02805 ā¢ Published ā¢ 3 -
Ultra-Long Sequence Distributed Transformer
Paper ā¢ 2311.02382 ā¢ Published ā¢ 2 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ā¢ 2309.11235 ā¢ Published ā¢ 16
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ā¢ 2401.02038 ā¢ Published ā¢ 62 -
Learning To Teach Large Language Models Logical Reasoning
Paper ā¢ 2310.09158 ā¢ Published ā¢ 1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ā¢ 2311.00176 ā¢ Published ā¢ 8 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper ā¢ 2308.09583 ā¢ Published ā¢ 7
-
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Paper ā¢ 2310.13961 ā¢ Published ā¢ 4 -
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
Paper ā¢ 2309.09582 ā¢ Published ā¢ 4 -
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Paper ā¢ 2310.13127 ā¢ Published ā¢ 11 -
Evaluating the Robustness to Instructions of Large Language Models
Paper ā¢ 2308.14306 ā¢ Published ā¢ 1
-
Moral Foundations of Large Language Models
Paper ā¢ 2310.15337 ā¢ Published ā¢ 1 -
Specific versus General Principles for Constitutional AI
Paper ā¢ 2310.13798 ā¢ Published ā¢ 2 -
Contrastive Prefence Learning: Learning from Human Feedback without RL
Paper ā¢ 2310.13639 ā¢ Published ā¢ 24 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper ā¢ 2309.00267 ā¢ Published ā¢ 47
-
TheBloke/Llama-2-7B-Chat-GGML
Text Generation ā¢ Updated ā¢ 2.72k ā¢ 865 -
uonlp/CulturaX
Viewer ā¢ Updated ā¢ 7.18B ā¢ 4.52k ā¢ 491 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ā¢ 2309.11235 ā¢ Published ā¢ 16 -
Self-Instruct: Aligning Language Model with Self Generated Instructions
Paper ā¢ 2212.10560 ā¢ Published ā¢ 9
-
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper ā¢ 2309.11235 ā¢ Published ā¢ 16 -
openchat/openchat-3.5-0106
Text Generation ā¢ Updated ā¢ 24.4k ā¢ 350 -
openchat/openchat-3.5-1210
Text Generation ā¢ Updated ā¢ 4.24k ā¢ 276 -
openchat/openchat_3.5
Text Generation ā¢ Updated ā¢ 25.5k ā¢ 1.12k