OPEA
/

DeepSeek-V3-int4-sym-gptq-inc

4-bit precision

Model card Files Files and versions Community

cicdatopea commited on 5 days ago

Commit

af0b9cd

·

verified ·

1 Parent(s): acad571

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ On CUDA devices, the computation dtype is typically FP16 for int4 , which may le
 from transformers import AutoModelForCausalLM, AutoTokenizer
 import torch
-quantized_model_dir = "/dataset/int4_models/DeepSeek-V3-int4-sym-gptq-inc-preview"
 ## directly use device_map='auto' if you have enough GPUs
 max_memory = {i: "75GiB" for i in range(7)}

 from transformers import AutoModelForCausalLM, AutoTokenizer
 import torch
+quantized_model_dir = "OPEA/DeepSeek-V3-int4-sym-gptq-inc"
 ## directly use device_map='auto' if you have enough GPUs
 max_memory = {i: "75GiB" for i in range(7)}