Training Language Models to Self-Correct via Reinforcement Learning Paper โข 2409.12917 โข Published Sep 19, 2024 โข 136
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper โข 2402.17764 โข Published Feb 27, 2024 โข 606
Self-Discover: Large Language Models Self-Compose Reasoning Structures Paper โข 2402.03620 โข Published Feb 6, 2024 โข 114