7 36 17

Zesen Cheng

ClownRat

AI & ML interests

multi-modal foundation model; Segmentation, Detection, and Tracking;

Recent Activity

updated a model 2 days ago

DAMO-NLP-SG/VL3-SigLIP-NaViT

liked a dataset 6 days ago

DAMO-NLP-SG/multimodal_textbook

upvoted a paper 11 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

View all activity

Organizations

ClownRat's activity

updated a model 2 days ago

DAMO-NLP-SG/VL3-SigLIP-NaViT

Feature Extraction • Updated 2 days ago

liked a dataset 6 days ago

DAMO-NLP-SG/multimodal_textbook

Updated 10 days ago • 9.9k • 122

upvoted 3 papers 11 days ago

New activity in DAMO-NLP-SG/VideoLLaMA2.1-7B-AV 12 days ago

Some weights of Videollama2Qwen2ForCausalLM were not initialized from the model checkpoint at ./VideoLLaMA2.1-7B-AV and are newly initialized:

#4 opened about 1 month ago by

zybbmn

authored a paper 15 days ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published 21 days ago • 41

upvoted a paper 15 days ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published 28 days ago • 70

updated a model 15 days ago

ClownRat/VideoLLaMA2.1-7B-16F

Text Generation • Updated 15 days ago • 51

upvoted 2 papers 15 days ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published 21 days ago • 41

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 19 days ago • 97

updated 2 models 27 days ago

ClownRat/resnet-50-torchvision

Updated 27 days ago • 1.82k

ClownRat/mask2former-resnet-50-coco-instance

Updated 27 days ago • 920

updated a model 29 days ago

ClownRat/resnet-101-torchvision

Updated 29 days ago • 8

updated a collection about 1 month ago

Mask2Former

Collection

2 items • Updated Dec 20, 2024

liked a dataset about 1 month ago

ClownRat/COCO2017-Instance

Viewer • Updated Dec 11, 2024 • 123k • 45 • 1

updated a model about 1 month ago

ClownRat/mask2former-resnet-101-coco-instance

Updated Dec 17, 2024 • 34

updated a dataset about 1 month ago

ClownRat/COCO2017-Instance

Viewer • Updated Dec 11, 2024 • 123k • 45 • 1

upvoted a paper about 1 month ago

Towards Universal Soccer Video Understanding

Paper • 2412.01820 • Published Dec 2, 2024 • 9