zamroni111
/

Meta-Llama-3.1-8B-Instruct-ONNX-DirectML-GenAI-INT4

Text Generation

Model card Files Files and versions Community

zamroni111 commited on Sep 11, 2024

Commit

d7264eb

·

verified ·

1 Parent(s): 452fed3

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsof
 https://onnxruntime.ai/docs/genai/howto/install.html#directml
 Created using ONNX Runtime GenAI's builder.py
-https://raw.githubusercontent.com/microsoft/onnxruntime-genai/main/src/python/py/models/builder.py -o builder.py
 INT4 accuracy level: FP32 (float32)
 8-bit quantization for MoE layers

 https://onnxruntime.ai/docs/genai/howto/install.html#directml
 Created using ONNX Runtime GenAI's builder.py
+https://raw.githubusercontent.com/microsoft/onnxruntime-genai/main/src/python/py/models/builder.py
 INT4 accuracy level: FP32 (float32)
 8-bit quantization for MoE layers