Siteng Huang's picture

3 4 1

Siteng Huang

huangsiteng

·

https://kyonhuang.top/

AI & ML interests

vision-language models

Recent Activity

authored a paper 28 days ago

Accelerating Diffusion Transformers with Token-wise Feature Caching

authored a paper 28 days ago

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

authored a paper 29 days ago

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

View all activity

Organizations

None yet

huangsiteng's activity

commented a paper 29 days ago

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Paper • 2412.06782 • Published 30 days ago • 6 •

commented a paper about 1 month ago

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Paper • 2411.17686 • Published Nov 26, 2024 • 19 •

commented a paper 4 months ago

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Paper • 2409.07239 • Published Sep 11, 2024 • 12 •