Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published Nov 25, 2024 • 15
AMO Sampler: Enhancing Text Rendering with Overshooting Paper • 2411.19415 • Published Nov 28, 2024 • 3
AMO Sampler: Enhancing Text Rendering with Overshooting Paper • 2411.19415 • Published Nov 28, 2024 • 3
AMO Sampler: Enhancing Text Rendering with Overshooting Paper • 2411.19415 • Published Nov 28, 2024 • 3 • 2
Memory-Efficient LLM Training with Online Subspace Descent Paper • 2408.12857 • Published Aug 23, 2024 • 13
Longhorn: State Space Models are Amortized Online Learners Paper • 2407.14207 • Published Jul 19, 2024 • 18
Longhorn: State Space Models are Amortized Online Learners Paper • 2407.14207 • Published Jul 19, 2024 • 18