EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models Paper • 2310.03270 • Published Oct 5, 2023
Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM Paper • 2310.04836 • Published Oct 7, 2023 • 1
Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization Paper • 2204.04215 • Published Apr 8, 2022 • 1
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models Paper • 2405.14366 • Published May 23, 2024 • 1
ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression Paper • 2410.08584 • Published Oct 11, 2024 • 12
ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality Paper • 2412.04062 • Published Dec 5, 2024 • 7
GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation Paper • 2411.18499 • Published Nov 27, 2024 • 18
GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation Paper • 2411.18499 • Published Nov 27, 2024 • 18
ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression Paper • 2410.08584 • Published Oct 11, 2024 • 12
DragAnything: Motion Control for Anything using Entity Representation Paper • 2403.07420 • Published Mar 12, 2024 • 13