Awesome Computer Use Agents Collection https://github.com/ranpox/awesome-computer-use • 25 items • Updated 18 days ago • 6
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published 10 days ago • 69
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published about 1 month ago • 123
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting Paper • 2411.17176 • Published Nov 26, 2024 • 23
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning Paper • 2410.02089 • Published Oct 2, 2024 • 12
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published Oct 30, 2024 • 46
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published Oct 31, 2024 • 59
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning Paper • 2410.21845 • Published Oct 29, 2024 • 13
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant Paper • 2410.18603 • Published Oct 24, 2024 • 32
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 186
Neural Amortized Inference for Nested Multi-agent Reasoning Paper • 2308.11071 • Published Aug 21, 2023 • 3
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages Paper • 2407.05975 • Published Jul 8, 2024 • 35
A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond Paper • 2403.14734 • Published Mar 21, 2024 • 21
Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives Paper • 2305.08088 • Published May 14, 2023 • 1
KS-Lottery: Finding Certified Lottery Tickets for Multilingual Language Models Paper • 2402.02801 • Published Feb 5, 2024 • 1