arxiv:2408.13257
Micheal Tian
StarBurger
ยท
AI & ML interests
self-driving, computer vision, self-supervised learning
Recent Activity
upvoted
a
paper
1 day ago
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
upvoted
a
paper
about 1 month ago
MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
authored
a paper
about 1 month ago
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution
Real-World Scenarios that are Difficult for Humans?
Organizations
None yet
Papers
1
models
None public yet
datasets
None public yet