Wentao Ma
tonymwt
AI & ML interests
LLM VISION ROBOTICS
Recent Activity
upvoted
a
paper
about 2 months ago
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding
by Video Spatiotemporal Augmentation
liked
a model
8 months ago
microsoft/Phi-3-vision-128k-instruct
upvoted
an
article
8 months ago
PaliGemma – Google's Cutting-Edge Open Vision Language Model
Organizations
None yet
models
None public yet
datasets
None public yet