OPEA
/

Llama-3.2-90B-Vision-Instruct-int4-sym-inc

4-bit precision

intel/auto-round

Model card Files Files and versions Community

cicdatopea commited on Dec 2, 2024

Commit

7e7016d

·

verified ·

1 Parent(s): 2571b44

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ datasets:
 ## Model Details
-This model is an int4 model with group_size 128 and symmetric quantization of [meta-llama/Llama-3.2-90B-Vision-Instruct](https://huggingface.co/meta-llama/Llama-3.2-90B-Vision-Instruct). Load the model with revision="4fd505b" to use auto_round format.
 ## How To Use
@@ -27,7 +27,7 @@ model = MllamaForConditionalGeneration.from_pretrained(
     quantized_model_path,
     torch_dtype="auto",
     device_map="auto",
-    ##revision="4fd505b" ##auto_round format
 )
 processor = AutoProcessor.from_pretrained(quantized_model_path)
 image_url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/0052a70beed5bf71b92610a43a52df6d286cd5f3/diffusers/rabbit.jpg"

 ## Model Details
+This model is an int4 model with group_size 128 and symmetric quantization of [meta-llama/Llama-3.2-90B-Vision-Instruct](https://huggingface.co/meta-llama/Llama-3.2-90B-Vision-Instruct). Load the model with revision="64f5493" to use AutoGPTQ format.
 ## How To Use
     quantized_model_path,
     torch_dtype="auto",
     device_map="auto",
+    ##revision="64f5493" ##AutoGPTQ format
 )
 processor = AutoProcessor.from_pretrained(quantized_model_path)
 image_url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/0052a70beed5bf71b92610a43a52df6d286cd5f3/diffusers/rabbit.jpg"