Unifying Specialized Visual Encoders for Video Language Models Paper • 2501.01426 • Published 5 days ago • 20
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark Paper • 2410.03051 • Published Oct 4, 2024 • 5