A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint.
-
apple/aimv2-large-patch14-224
Image Feature Extraction β’ Updated β’ 2.09k β’ 40 -
apple/aimv2-huge-patch14-224
Image Feature Extraction β’ Updated β’ 88 β’ 7 -
apple/aimv2-1B-patch14-224
Image Feature Extraction β’ Updated β’ 99 β’ 4 -
apple/aimv2-3B-patch14-224
Image Feature Extraction β’ Updated β’ 49 β’ 2