-
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Paper • 2305.11738 • Published • 8 -
Shepherd: A Critic for Language Model Generation
Paper • 2308.04592 • Published • 31 -
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Paper • 2402.14809 • Published • 3 -
DRLC: Reinforcement Learning with Dense Rewards from LLM Critic
Paper • 2401.07382 • Published • 2
Collections
Discover the best community collections!
Collections including paper arxiv:2308.04592
-
Measuring the Effects of Data Parallelism on Neural Network Training
Paper • 1811.03600 • Published • 2 -
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Paper • 1804.04235 • Published • 2 -
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Paper • 1905.11946 • Published • 3 -
Yi: Open Foundation Models by 01.AI
Paper • 2403.04652 • Published • 62
-
Detecting Pretraining Data from Large Language Models
Paper • 2310.16789 • Published • 10 -
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
Paper • 2310.13671 • Published • 18 -
AutoMix: Automatically Mixing Language Models
Paper • 2310.12963 • Published • 14 -
An Emulator for Fine-Tuning Large Language Models using Small Language Models
Paper • 2310.12962 • Published • 14
-
Internet-Augmented Dialogue Generation
Paper • 2107.07566 • Published • 2 -
Multi-hop Question Answering via Reasoning Chains
Paper • 1910.02610 • Published • 2 -
LaMDA: Language Models for Dialog Applications
Paper • 2201.08239 • Published • 4 -
WebGPT: Browser-assisted question-answering with human feedback
Paper • 2112.09332 • Published • 2
-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 83 -
Baichuan 2: Open Large-scale Language Models
Paper • 2309.10305 • Published • 19 -
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 37 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 65
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 75 -
One Wide Feedforward is All You Need
Paper • 2309.01826 • Published • 31 -
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 41 -
Shepherd: A Critic for Language Model Generation
Paper • 2308.04592 • Published • 31