From Imitation to Refinement -- Residual RL for Precise Visual Assembly Paper • 2407.16677 • Published Jul 23, 2024 • 1
Repeat After Me: Transformers are Better than State Space Models at Copying Paper • 2402.01032 • Published Feb 1, 2024 • 22
Hierarchical reinforcement learning with natural language subgoals Paper • 2309.11564 • Published Sep 20, 2023
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent Paper • 2312.10003 • Published Dec 15, 2023 • 37