PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 121
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 121
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 121
Improving fine-grained understanding in image-text pre-training Paper • 2401.09865 • Published Jan 18, 2024 • 16
A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark Paper • 1910.04867 • Published Oct 1, 2019
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Paper • 2010.11929 • Published Oct 22, 2020 • 7
Big Transfer (BiT): General Visual Representation Learning Paper • 1912.11370 • Published Dec 24, 2019 • 1
Knowledge distillation: A good teacher is patient and consistent Paper • 2106.05237 • Published Jun 9, 2021
Image Captioners Are Scalable Vision Learners Too Paper • 2306.07915 • Published Jun 13, 2023 • 11
Scaling Vision Transformers to 22 Billion Parameters Paper • 2302.05442 • Published Feb 10, 2023 • 1