OPEA
/

DeepSeek-V3-int4-sym-gptq-inc

4-bit precision

Model card Files Files and versions Community

cicdatopea commited on 21 days ago

Commit

1b23955

·

verified ·

1 Parent(s): 58077b4

Update README.md

Files changed (1) hide show

README.md +1 -7

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ base_model:
 This model is an int4 model with group_size 128 and symmetric quantization of [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3) generated by [intel/auto-round](https://github.com/intel/auto-round) algorithm.
-**Loading the model in Transformers can be quite slow, especially with CUDA devices(30m-1hours). Consider using an alternative serving framework. ** However, we have not tested it on other frameworks due to limited cuda resources.
 Please follow the license of the original model.
@@ -160,13 +160,7 @@ Generated: DeepSeek Artificial Intelligence Co., Ltd. (referred to as "DeepSeek"
 Prompt: hello
 Generated: Hello! How can I assist you today? 😊
 """
 ~~~

 This model is an int4 model with group_size 128 and symmetric quantization of [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3) generated by [intel/auto-round](https://github.com/intel/auto-round) algorithm.
+**Loading the model in Transformers can be quite slow, especially with CUDA devices(30m-1hours). Consider using an alternative serving framework.** However, we have not tested it on other frameworks due to limited cuda resources.
 Please follow the license of the original model.
 Prompt: hello
 Generated: Hello! How can I assist you today? 😊
 """
 ~~~