Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published 6 days ago • 16
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Paper • 2412.04432 • Published Dec 5, 2024 • 14
Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration Paper • 2412.13180 • Published 19 days ago • 12
Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers Paper • 2412.12276 • Published 20 days ago • 15
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment Paper • 2411.10606 • Published Nov 15, 2024 • 1
MaestroMotif: Skill Design from Artificial Intelligence Feedback Paper • 2412.08542 • Published 26 days ago • 1
CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models Paper • 2412.07393 • Published 27 days ago • 2
Video Token Merging for Long-form Video Understanding Paper • 2410.23782 • Published Oct 31, 2024 • 2
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 57
APOLLO: SGD-like Memory, AdamW-level Performance Paper • 2412.05270 • Published about 1 month ago • 38
Moto: Latent Motion Token as the Bridging Language for Robot Manipulation Paper • 2412.04445 • Published Dec 5, 2024 • 21
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published 27 days ago • 66
Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction Paper • 2411.14762 • Published Nov 22, 2024 • 11
Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows Paper • 2406.16218 • Published Jun 23, 2024 • 2
Combining Induction and Transduction for Abstract Reasoning Paper • 2411.02272 • Published Nov 4, 2024 • 1
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published Nov 29, 2024 • 56