arxiv:2410.07113
Jianshu Zhang
Sterzhang
AI & ML interests
Data-Centric AI, Multi-Modal Understanding
Recent Activity
upvoted
a
paper
3 days ago
2.5 Years in Class: A Multimodal Textbook for Vision-Language
Pretraining
commented
a paper
3 days ago
2.5 Years in Class: A Multimodal Textbook for Vision-Language
Pretraining
upvoted
a
paper
3 days ago
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse
Task Synthesis
Organizations
Papers
1
models
None public yet
datasets
6
Sterzhang/tmp-bench
Preview
•
Updated
•
17
Sterzhang/P-Bench-Choice
Viewer
•
Updated
•
1.14k
•
7
Sterzhang/tmp
Viewer
•
Updated
•
65.4k
•
3
Sterzhang/tmp1
Updated
•
2
Sterzhang/PVIT-3M
Viewer
•
Updated
•
3M
•
12.3k
•
17
Sterzhang/image-textualization
Preview
•
Updated
•
212
•
15