Temporal Preference Optimization for Long-Form Video Understanding Paper • 2501.13919 • Published 8 days ago • 21
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published 19 days ago • 49