Maximizing Alignment with Minimal Feedback: Efficiently Learning Rewards for Visuomotor Robot Policy Alignment Paper • 2412.04835 • Published Dec 6, 2024 • 2