arxiv:2410.20482
Hao Fei
scofield7419
AI & ML interests
Natural Language Processing, Vision and Language, Structural Modeling, Large Language Model
Recent Activity
upvoted
a
paper
13 days ago
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of
Images and Videos
commented on
a paper
19 days ago
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating,
Segmenting, Editing
commented on
a paper
19 days ago
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating,
Segmenting, Editing
Organizations
None yet
Papers
20
models
None public yet
datasets
None public yet