Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Paper • 2412.14171 • Published Dec 18, 2024 • 24
google/siglip-so400m-patch14-384 Zero-Shot Image Classification • Updated Sep 26, 2024 • 3.2M • 444
timm/ViT-SO400M-14-SigLIP-384 Zero-Shot Image Classification • Updated Oct 27, 2023 • 35.6k • 80