Zhoues's picture

2 9 3

Zhoues

Zhoues

·

https://github.com/Zhoues

Zhoues

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

commented a paper about 1 month ago

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

upvoted a paper about 1 month ago

MV-Adapter: Multi-view Consistent Image Generation Made Easy

View all activity

Organizations

None yet

Zhoues's activity

upvoted 4 papers about 1 month ago

MV-Adapter: Multi-view Consistent Image Generation Made Easy

Paper • 2412.03632 • Published Dec 4, 2024 • 23

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Paper • 2412.04455 • Published Dec 5, 2024 • 37

MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Paper • 2412.03558 • Published Dec 4, 2024 • 15

VLSBench: Unveiling Visual Leakage in Multimodal Safety

Paper • 2411.19939 • Published Nov 29, 2024 • 9

upvoted a paper 2 months ago

WorldSimBench: Towards Video Generation Models as World Simulators

Paper • 2410.18072 • Published Oct 23, 2024 • 18

upvoted a paper 5 months ago

TrackGo: A Flexible and Efficient Method for Controllable Video Generation

Paper • 2408.11475 • Published Aug 21, 2024 • 17

upvoted 2 papers 10 months ago

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Paper • 2403.12037 • Published Mar 18, 2024 • 1

MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception

Paper • 2312.07472 • Published Dec 12, 2023 • 2

upvoted a paper 11 months ago

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Paper • 2401.15071 • Published Jan 26, 2024 • 35