arxiv:2410.12787
Xin Li PRO
lixin4ever
AI & ML interests
Natural Language Processing, Machine Learning
Recent Activity
upvoted
a
paper
about 20 hours ago
2.5 Years in Class: A Multimodal Textbook for Vision-Language
Pretraining
liked
a dataset
about 20 hours ago
DAMO-NLP-SG/multimodal_textbook
upvoted
a
paper
about 24 hours ago
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with
Video LLM
Organizations
spaces
2
models
None public yet
datasets
None public yet