-
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Paper • 2404.13013 • Published • 31 -
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Paper • 2404.12253 • Published • 55 -
Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity
Paper • 2403.12267 • Published -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 41
Oliver Wei
Oliver2021
AI & ML interests
None yet
Recent Activity
liked
a model
2 days ago
deepseek-ai/DeepSeek-R1
liked
a Space
5 days ago
LittleFrog/MatchAnything
upvoted
a
paper
9 days ago
VideoRAG: Retrieval-Augmented Generation over Video Corpus
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet