OPEA
/

Meta-Llama-3.1-70B-Instruct-int4-asym-inc

cicdatopea commited on Nov 30, 2024

Commit

5b5e906

verified ·

1 Parent(s): 3f1badb

update readme of must install auto-round from source for asym quantization

Files changed (1) hide show

README.md CHANGED Viewed

@@ -11,6 +11,8 @@ This model is an int4 model with group_size 128 and asymmetric quantization of [
 HPU: docker image with Gaudi Software Stack is recommended, please refer to following script for environment setup. More details can be found in [Gaudi Guide](https://docs.habana.ai/en/latest/Installation_Guide/Bare_Metal_Fresh_OS.html#launch-docker-image-that-was-built).
 ```python
 from auto_round import AutoHfQuantizer ##must import
 import torch

 HPU: docker image with Gaudi Software Stack is recommended, please refer to following script for environment setup. More details can be found in [Gaudi Guide](https://docs.habana.ai/en/latest/Installation_Guide/Bare_Metal_Fresh_OS.html#launch-docker-image-that-was-built).
+CUDA(must install from souce): git clone https://github.com/intel/auto-round  && cd auto-round && pip install -vvv --no-build-isolation -e .
 ```python
 from auto_round import AutoHfQuantizer ##must import
 import torch