Collections
Discover the best community collections!
Collections including paper arxiv:2309.05463
-
Textbooks Are All You Need
Paper β’ 2306.11644 β’ Published β’ 143 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper β’ 2309.05463 β’ Published β’ 87 -
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Paper β’ 2305.07759 β’ Published β’ 33 -
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper β’ 2406.20094 β’ Published β’ 98
-
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 50 -
Language Models are Few-Shot Learners
Paper β’ 2005.14165 β’ Published β’ 12 -
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
Paper β’ 2305.13245 β’ Published β’ 5 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 243
-
Textbooks Are All You Need II: phi-1.5 technical report
Paper β’ 2309.05463 β’ Published β’ 87 -
Neurons in Large Language Models: Dead, N-gram, Positional
Paper β’ 2309.04827 β’ Published β’ 16 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper β’ 2403.09629 β’ Published β’ 76
-
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Paper β’ 2402.13064 β’ Published β’ 48 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper β’ 2309.05463 β’ Published β’ 87 -
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Paper β’ 2402.10379 β’ Published β’ 31 -
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Paper β’ 2312.06585 β’ Published β’ 28
-
A Survey on Language Models for Code
Paper β’ 2311.07989 β’ Published β’ 21 -
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
Paper β’ 2310.06770 β’ Published β’ 4 -
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
Paper β’ 2401.03065 β’ Published β’ 11 -
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper β’ 2402.14261 β’ Published β’ 10
-
Visual In-Context Prompting
Paper β’ 2311.13601 β’ Published β’ 16 -
Textbooks Are All You Need
Paper β’ 2306.11644 β’ Published β’ 143 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper β’ 2308.08155 β’ Published β’ 5 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper β’ 2303.02927 β’ Published β’ 3
-
Textbooks Are All You Need
Paper β’ 2306.11644 β’ Published β’ 143 -
LLaVA-Ο: Efficient Multi-Modal Assistant with Small Language Model
Paper β’ 2401.02330 β’ Published β’ 14 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper β’ 2309.05463 β’ Published β’ 87 -
Visual Instruction Tuning
Paper β’ 2304.08485 β’ Published β’ 13
-
INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning
Paper β’ 2401.06532 β’ Published β’ 12 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper β’ 2309.05463 β’ Published β’ 87 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper β’ 2309.00267 β’ Published β’ 47 -
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Paper β’ 2312.15685 β’ Published β’ 16