zamroni111
/

Meta-Llama-3.1-8B-Instruct-ONNX-DirectML-GenAI-INT4

Text Generation

Model card Files Files and versions Community

zamroni111 commited on Sep 11, 2024

Commit

49b8e8a

·

verified ·

1 Parent(s): d7264eb

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -15,17 +15,18 @@ meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsof
 ### Model Description
 meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization
 https://onnxruntime.ai/docs/genai/howto/install.html#directml
 Created using ONNX Runtime GenAI's builder.py
 https://raw.githubusercontent.com/microsoft/onnxruntime-genai/main/src/python/py/models/builder.py
 INT4 accuracy level: FP32 (float32)
 8-bit quantization for MoE layers
 - **Developed by:** Mochamad Aris Zamroni
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
 - **Language(s) (NLP):** [More Information Needed]
 - **License:** [More Information Needed]
@@ -33,7 +34,7 @@ INT4 accuracy level: FP32 (float32)
 ### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
 - **Repository:** [More Information Needed]
 - **Paper [optional]:** [More Information Needed]

 ### Model Description
 meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization
 https://onnxruntime.ai/docs/genai/howto/install.html#directml
 Created using ONNX Runtime GenAI's builder.py
 https://raw.githubusercontent.com/microsoft/onnxruntime-genai/main/src/python/py/models/builder.py
 INT4 accuracy level: FP32 (float32)
 8-bit quantization for MoE layers
 - **Developed by:** Mochamad Aris Zamroni
 - **Model type:** [More Information Needed]
 - **Language(s) (NLP):** [More Information Needed]
 - **License:** [More Information Needed]
 ### Model Sources [optional]
+https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct
 - **Repository:** [More Information Needed]
 - **Paper [optional]:** [More Information Needed]