Kuo-Hsin Tu's picture

118 45

Kuo-Hsin Tu

dapumptu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

A3: Android Agent Arena for Mobile GUI Agents

upvoted a paper 3 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

upvoted a paper 3 days ago

ProgCo: Program Helps Self-Correction of Large Language Models

View all activity

Organizations

None yet

dapumptu's activity

upvoted 3 papers 3 days ago

A3: Android Agent Arena for Mobile GUI Agents

Paper • 2501.01149 • Published 4 days ago • 20

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 4 days ago • 75

ProgCo: Program Helps Self-Correction of Large Language Models

Paper • 2501.01264 • Published 4 days ago • 22

liked 3 models 3 days ago

lianghsun/Llama-3.2-Taiwan-3B

Text Generation • Updated 5 days ago • 510 • 11

lianghsun/Llama-3.2-Taiwan-1B-Instruct

Text Generation • Updated 7 days ago • 22 • 2

lianghsun/Llama-3.2-Taiwan-3B-Instruct

Text Generation • Updated 5 days ago • 56 • 11

upvoted 4 papers 6 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published 21 days ago • 49

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

Paper • 2412.21199 • Published 6 days ago • 9

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published 6 days ago • 16

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published 7 days ago • 29

upvoted 6 papers 7 days ago

In Case You Missed It: ARC 'Challenge' Is Not That Challenging

Paper • 2412.17758 • Published 13 days ago • 16

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 13 days ago • 42

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published 13 days ago • 34

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published 13 days ago • 59

MMFactory: A Universal Solution Search Engine for Vision-Language Tasks

Paper • 2412.18072 • Published 13 days ago • 14

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 12 days ago • 86

updated a collection 12 days ago

agent

2 items • Updated 12 days ago

upvoted 3 papers 12 days ago

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Paper • 2412.17589 • Published 14 days ago • 12

OpenAI o1 System Card

Paper • 2412.16720 • Published 15 days ago • 29

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 14 days ago • 41