MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper • 2412.14475 • Published 18 days ago • 52
WavePulse: Real-time Content Analytics of Radio Livestreams Paper • 2412.17998 • Published 13 days ago • 9
Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published 18 days ago • 7
No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published 21 days ago • 41
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 4 days ago • 75