Quantized version?

#24

by MichielBontenbal - opened Apr 19, 2024

Apr 19, 2024

Will quantized versions of Idefics be published? Look forward to it!

Apr 19, 2024

hi @MichielBontenbal
did you check the section https://huggingface.co/HuggingFaceM4/idefics2-8b#model-optimizations?
i am about to add some more information on memory requirements for 4 bit quantized versions. as a sneak peak, there are lots of possibilities to run inference on a <16GB chip!

VictorSanh changed discussion status to closed May 1, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment