Twins: Revisiting the Design of Spatial Attention in Vision Transformers Paper • 2104.13840 • Published Apr 28, 2021
YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications Paper • 2209.02976 • Published Sep 7, 2022
ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradient Accumulation Paper • 2011.11233 • Published Nov 23, 2020
Norm Tweaking: High-performance Low-bit Quantization of Large Language Models Paper • 2309.02784 • Published Sep 6, 2023 • 1
MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices Paper • 2312.16886 • Published Dec 28, 2023 • 19
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model Paper • 2402.03766 • Published Feb 6, 2024 • 13
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper • 2403.00522 • Published Mar 1, 2024 • 44
FPTQ: Fine-grained Post-Training Quantization for Large Language Models Paper • 2308.15987 • Published Aug 30, 2023 • 1
Lenna: Language Enhanced Reasoning Detection Assistant Paper • 2312.02433 • Published Dec 5, 2023 • 2
MixPath: A Unified Approach for One-shot Neural Architecture Search Paper • 2001.05887 • Published Jan 16, 2020 • 1