1 201 13

Chan Kim

chanmuzi

chanmuzi

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

MLLM-as-a-Judge for Image Safety without Human Labeling

upvoted a paper 3 days ago

Xmodel-2 Technical Report

upvoted a paper 4 days ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

View all activity

Organizations

chanmuzi's activity

upvoted a paper 2 days ago

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published 6 days ago • 22

upvoted a paper 3 days ago

Xmodel-2 Technical Report

Paper • 2412.19638 • Published 10 days ago • 23

upvoted a paper 4 days ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published 13 days ago • 62

upvoted a paper 8 days ago

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

Paper • 2412.17483 • Published 14 days ago • 29

upvoted a paper 11 days ago

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published 17 days ago • 16

upvoted a paper 12 days ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published 13 days ago • 37

upvoted 2 papers 13 days ago

Outcome-Refining Process Supervision for Code Generation

Paper • 2412.15118 • Published 17 days ago • 19

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 14 days ago • 44

upvoted a paper 18 days ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 18 days ago • 48

upvoted 2 papers 19 days ago

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published 26 days ago • 35

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 24 days ago • 81

upvoted a paper 20 days ago

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published 21 days ago • 6

upvoted 2 papers about 1 month ago

LongKey: Keyphrase Extraction for Long Documents

Paper • 2411.17863 • Published Nov 26, 2024 • 11

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 48

upvoted a paper about 2 months ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 43

upvoted a paper 2 months ago

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 21

upvoted 4 papers 3 months ago

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Paper • 2410.13276 • Published Oct 17, 2024 • 26

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 43

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17, 2024 • 32

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75