zhangyunfeng's picture

5 19

zhangyunfeng

yunfeng

·

yunfengsay

AI & ML interests

None yet

Recent Activity

liked a Space 14 days ago

Qwen/QVQ-72B-preview

upvoted an article about 1 month ago

seemore: Implement a Vision Language Model from Scratch

View all activity

Organizations

None yet

yunfeng's activity

upvoted an article about 1 month ago

Article

seemore: Implement a Vision Language Model from Scratch

By

•

Jun 23, 2024

• 69

upvoted a collection 3 months ago

Multimodal RAG

10 items • Updated Sep 5, 2024 • 25

upvoted a collection 9 months ago

MGM

Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3, 2024 • 47

upvoted a collection 11 months ago

From screenshots to HTML

WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. • 4 items • Updated Apr 15, 2024 • 19

upvoted a paper 12 months ago

Lumiere: A Space-Time Diffusion Model for Video Generation

Paper • 2401.12945 • Published Jan 23, 2024 • 86