-
Vision Transformer Adapters for Generalizable Multitask Learning
Paper • 2308.12372 • Published -
RMT: Retentive Networks Meet Vision Transformers
Paper • 2309.11523 • Published • 33 -
DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion
Paper • 2309.12424 • Published • 11 -
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper • 2310.09199 • Published • 26
Collections
Discover the best community collections!
Collections including paper arxiv:2310.09199