haneol j. kim

HaneolKijm

https://haneol-kijm.github.io/

AI & ML interests

computer vision, diffusion, LLM agent, deep RL

Recent Activity

upvoted a paper 2 days ago

PaSa: An LLM Agent for Comprehensive Academic Paper Search

upvoted a paper 2 days ago

Evolving Deeper LLM Thinking

upvoted a paper 2 days ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

View all activity

Organizations

None yet

HaneolKijm's activity

upvoted 13 papers 2 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 10 days ago • 279

Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published 9 days ago • 44

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 10 days ago • 62

Humanity's Last Exam

Paper • 2501.14249 • Published 9 days ago • 48

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 8 days ago • 38

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published 5 days ago • 22

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 7 days ago • 43

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 4 days ago • 25

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 4 days ago • 53

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 3 days ago • 38

upvoted a paper 15 days ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 16 days ago • 66

upvoted 5 papers 17 days ago

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 24 days ago • 53

VideoAuteur: Towards Long Narrative Video Generation

Paper • 2501.06173 • Published 22 days ago • 31

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 22 days ago • 79

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 19 days ago • 89

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 18 days ago • 271

upvoted a paper 22 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 23 days ago • 87