zamroni111
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@ meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsof
|
|
18 |
https://onnxruntime.ai/docs/genai/howto/install.html#directml
|
19 |
|
20 |
Created using ONNX Runtime GenAI's builder.py
|
21 |
-
https://raw.githubusercontent.com/microsoft/onnxruntime-genai/main/src/python/py/models/builder.py
|
22 |
|
23 |
INT4 accuracy level: FP32 (float32)
|
24 |
8-bit quantization for MoE layers
|
|
|
18 |
https://onnxruntime.ai/docs/genai/howto/install.html#directml
|
19 |
|
20 |
Created using ONNX Runtime GenAI's builder.py
|
21 |
+
https://raw.githubusercontent.com/microsoft/onnxruntime-genai/main/src/python/py/models/builder.py
|
22 |
|
23 |
INT4 accuracy level: FP32 (float32)
|
24 |
8-bit quantization for MoE layers
|