Adaptive Length Image Tokenization via Recurrent Allocation Paper • 2411.02393 • Published Nov 4, 2024 • 12 • 1
Inference Optimal VLMs Need Only One Visual Token but Larger Models Paper • 2411.03312 • Published Nov 5, 2024 • 6 • 1
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration Paper • 2410.18076 • Published Oct 23, 2024 • 4 • 2
Autoregressive Large Language Models are Computationally Universal Paper • 2410.03170 • Published Oct 4, 2024 • 1 • 1
Can Models Learn Skill Composition from Examples? Paper • 2409.19808 • Published Sep 29, 2024 • 9 • 2
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control Paper • 2409.12192 • Published Sep 18, 2024 • 4 • 3
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control Paper • 2409.12192 • Published Sep 18, 2024 • 4 • 3
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization Paper • 2409.12903 • Published Sep 19, 2024 • 22 • 5
The Unreasonable Ineffectiveness of the Deeper Layers Paper • 2403.17887 • Published Mar 26, 2024 • 78 • 14