15 74 18

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper 3 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

upvoted a paper 3 days ago

ProgCo: Program Helps Self-Correction of Large Language Models

commented a paper 3 days ago

ProgCo: Program Helps Self-Correction of Large Language Models

View all activity

Organizations

dongguanting's activity

upvoted 2 papers 3 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 6 days ago • 25

ProgCo: Program Helps Self-Correction of Large Language Models

Paper • 2501.01264 • Published 4 days ago • 22

upvoted 2 papers 6 days ago

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published 18 days ago • 52

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published 13 days ago • 59

upvoted a paper 11 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 14 days ago • 41

upvoted a paper 13 days ago

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 14 days ago • 44

upvoted a paper 17 days ago

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published 18 days ago • 69

upvoted a collection 17 days ago

VisionLM

Collection

596 items • Updated about 2 hours ago • 39

upvoted a paper 17 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 17 days ago • 335

upvoted 2 papers 18 days ago

Shepherd: A Critic for Language Model Generation

Paper • 2308.04592 • Published Aug 8, 2023 • 31

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Paper • 2305.11738 • Published May 19, 2023 • 8

upvoted a collection 18 days ago

UI Agent

Collection

a collection of algorithmic agents for user interfaces/interactions and program synthesis • 236 items • Updated 2 days ago • 38

upvoted 3 papers 19 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 24 days ago • 81

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Paper • 2412.13018 • Published 20 days ago • 41

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Paper • 2412.12606 • Published 20 days ago • 41

upvoted 3 papers 20 days ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 88

Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published 22 days ago • 26

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published 21 days ago • 33

upvoted 2 papers 21 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 25 days ago • 96

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 24 days ago • 92