cagatay odabasi

cagatayodabasi

cagbal

AI & ML interests

None yet

Recent Activity

liked a dataset 26 days ago

allenai/pixmo-points

upvoted a collection 27 days ago

PixMo

liked a model about 1 month ago

microsoft/OmniParser

View all activity

Organizations

None yet

cagatayodabasi's activity

upvoted a collection 27 days ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated Nov 27, 2024 • 53

upvoted a paper about 2 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 111

upvoted a collection 4 months ago

Theia

Collection

Distilling Diverse Vision Foundation Models for Robot Learning • 6 items • Updated Sep 30, 2024 • 9

upvoted an article 4 months ago

Article

Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚

•

Jul 10, 2024

• 44

upvoted 2 papers 4 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 89

3D-VLA: A 3D Vision-Language-Action Generative World Model

Paper • 2403.09631 • Published Mar 14, 2024 • 7

upvoted a collection 5 months ago

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 22 days ago • 60

upvoted 6 papers 5 months ago

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Paper • 2408.03615 • Published Aug 7, 2024 • 30

Achieving Human Level Competitive Robot Table Tennis

Paper • 2408.03906 • Published Aug 7, 2024 • 26

upvoted an article 5 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 238

upvoted a paper about 1 year ago

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Paper • 2312.13252 • Published Dec 20, 2023 • 27