Collections

Discover the best community collections!

Collections including paper arxiv:2412.04432
video LM
Collection by about 4 hours ago
Video
Collection by 4 days ago
Unified MLLM
Unified model that generate Text, Image, Video
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.
VisionLM
Collection by 1 day ago
video
Collection by about 15 hours ago
daily papers
Collection by 13 days ago