3 7 1

Kanzhi Cheng

cckevinn

AI & ML interests

None yet

Recent Activity

authored a paper 15 days ago

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents

authored a paper 15 days ago

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

authored a paper 15 days ago

Vision-Language Models Can Self-Improve Reasoning via Reflection

View all activity

Organizations

cckevinn's activity

authored 4 papers 15 days ago

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents

Paper • 2401.10935 • Published Jan 17, 2024 • 4

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

Paper • 2406.11736 • Published Jun 17, 2024 • 5

Vision-Language Models Can Self-Improve Reasoning via Reflection

Paper • 2411.00855 • Published Oct 30, 2024 • 5

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published 26 days ago • 81

upvoted a paper 20 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published 26 days ago • 81

upvoted a collection 23 days ago

OS-Genesis

Collection

11 items • Updated 16 days ago • 6

reacted to Symbol-LLM's post with 🚀🔥🔥 2 months ago

Post

983

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning !

📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection

🔗 Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)

😇Takeaways:

- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.

- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !