-
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning
Paper ā¢ 2308.12032 ā¢ Published ā¢ 1 -
Know thy corpus! Robust methods for digital curation of Web corpora
Paper ā¢ 2003.06389 ā¢ Published ā¢ 1 -
Self-Alignment with Instruction Backtranslation
Paper ā¢ 2308.06259 ā¢ Published ā¢ 41 -
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Paper ā¢ 2305.06156 ā¢ Published ā¢ 2
Collections
Discover the best community collections!
Collections including paper arxiv:2402.03300
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 50 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ā¢ 2307.08691 ā¢ Published ā¢ 8 -
Mixtral of Experts
Paper ā¢ 2401.04088 ā¢ Published ā¢ 158 -
Mistral 7B
Paper ā¢ 2310.06825 ā¢ Published ā¢ 47
-
KwaiYiiMath: Technical Report
Paper ā¢ 2310.07488 ā¢ Published ā¢ 2 -
Forward-Backward Reasoning in Large Language Models for Mathematical Verification
Paper ā¢ 2308.07758 ā¢ Published ā¢ 4 -
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Paper ā¢ 2309.10814 ā¢ Published ā¢ 3 -
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning
Paper ā¢ 2310.03731 ā¢ Published ā¢ 29
-
Moral Foundations of Large Language Models
Paper ā¢ 2310.15337 ā¢ Published ā¢ 1 -
Specific versus General Principles for Constitutional AI
Paper ā¢ 2310.13798 ā¢ Published ā¢ 2 -
Contrastive Prefence Learning: Learning from Human Feedback without RL
Paper ā¢ 2310.13639 ā¢ Published ā¢ 25 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper ā¢ 2309.00267 ā¢ Published ā¢ 48
-
Text-to-3D using Gaussian Splatting
Paper ā¢ 2309.16585 ā¢ Published ā¢ 31 -
FP8-LM: Training FP8 Large Language Models
Paper ā¢ 2310.18313 ā¢ Published ā¢ 33 -
Zephyr: Direct Distillation of LM Alignment
Paper ā¢ 2310.16944 ā¢ Published ā¢ 123 -
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Paper ā¢ 2312.06585 ā¢ Published ā¢ 28
-
Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Paper ā¢ 2309.03550 ā¢ Published ā¢ 11 -
Memory Augmented Language Models through Mixture of Word Experts
Paper ā¢ 2311.10768 ā¢ Published ā¢ 17 -
GAIA: a benchmark for General AI Assistants
Paper ā¢ 2311.12983 ā¢ Published ā¢ 188 -
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Paper ā¢ 2311.12631 ā¢ Published ā¢ 13