-
Large Language Model Alignment: A Survey
Paper ā¢ 2309.15025 ā¢ Published ā¢ 2 -
Aligning Large Language Models with Human: A Survey
Paper ā¢ 2307.12966 ā¢ Published ā¢ 1 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper ā¢ 2305.18290 ā¢ Published ā¢ 50 -
SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Paper ā¢ 2310.05344 ā¢ Published ā¢ 1
Collections
Discover the best community collections!
Collections including paper arxiv:2201.11903
-
Lost in the Middle: How Language Models Use Long Contexts
Paper ā¢ 2307.03172 ā¢ Published ā¢ 37 -
Efficient Estimation of Word Representations in Vector Space
Paper ā¢ 1301.3781 ā¢ Published ā¢ 6 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper ā¢ 1810.04805 ā¢ Published ā¢ 16 -
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 50
-
TinyLlama: An Open-Source Small Language Model
Paper ā¢ 2401.02385 ā¢ Published ā¢ 90 -
MM-LLMs: Recent Advances in MultiModal Large Language Models
Paper ā¢ 2401.13601 ā¢ Published ā¢ 45 -
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper ā¢ 2401.15024 ā¢ Published ā¢ 69 -
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Paper ā¢ 2401.16380 ā¢ Published ā¢ 48
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 50 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper ā¢ 1810.04805 ā¢ Published ā¢ 16 -
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Paper ā¢ 1907.11692 ā¢ Published ā¢ 7 -
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Paper ā¢ 1910.01108 ā¢ Published ā¢ 14
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 50 -
Language Models are Few-Shot Learners
Paper ā¢ 2005.14165 ā¢ Published ā¢ 12 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper ā¢ 2201.11903 ā¢ Published ā¢ 9 -
Orca 2: Teaching Small Language Models How to Reason
Paper ā¢ 2311.11045 ā¢ Published ā¢ 71
-
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Paper ā¢ 2310.01352 ā¢ Published ā¢ 7 -
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Paper ā¢ 2203.11171 ā¢ Published ā¢ 3 -
MemGPT: Towards LLMs as Operating Systems
Paper ā¢ 2310.08560 ā¢ Published ā¢ 7 -
Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models
Paper ā¢ 2310.06117 ā¢ Published ā¢ 3
-
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Paper ā¢ 2311.07590 ā¢ Published ā¢ 16 -
A Survey on Language Models for Code
Paper ā¢ 2311.07989 ā¢ Published ā¢ 21 -
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation
Paper ā¢ 2311.08877 ā¢ Published ā¢ 6 -
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise
Paper ā¢ 2312.12436 ā¢ Published ā¢ 13
-
Contrastive Chain-of-Thought Prompting
Paper ā¢ 2311.09277 ā¢ Published ā¢ 34 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper ā¢ 2201.11903 ā¢ Published ā¢ 9 -
Orca 2: Teaching Small Language Models How to Reason
Paper ā¢ 2311.11045 ā¢ Published ā¢ 71 -
System 2 Attention (is something you might need too)
Paper ā¢ 2311.11829 ā¢ Published ā¢ 39
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ā¢ 2401.02038 ā¢ Published ā¢ 62 -
Learning To Teach Large Language Models Logical Reasoning
Paper ā¢ 2310.09158 ā¢ Published ā¢ 1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ā¢ 2311.00176 ā¢ Published ā¢ 8 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper ā¢ 2308.09583 ā¢ Published ā¢ 7
-
Retentive Network: A Successor to Transformer for Large Language Models
Paper ā¢ 2307.08621 ā¢ Published ā¢ 170 -
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Paper ā¢ 2303.12712 ā¢ Published ā¢ 2 -
GPT-4 Technical Report
Paper ā¢ 2303.08774 ā¢ Published ā¢ 5 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper ā¢ 2201.11903 ā¢ Published ā¢ 9