zamroni111 commited on
Commit
49b8e8a
·
verified ·
1 Parent(s): d7264eb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -15,17 +15,18 @@ meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsof
15
 
16
  ### Model Description
17
  meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization
 
18
  https://onnxruntime.ai/docs/genai/howto/install.html#directml
19
 
20
  Created using ONNX Runtime GenAI's builder.py
 
21
  https://raw.githubusercontent.com/microsoft/onnxruntime-genai/main/src/python/py/models/builder.py
22
 
23
  INT4 accuracy level: FP32 (float32)
 
24
  8-bit quantization for MoE layers
25
 
26
  - **Developed by:** Mochamad Aris Zamroni
27
- - **Funded by [optional]:** [More Information Needed]
28
- - **Shared by [optional]:** [More Information Needed]
29
  - **Model type:** [More Information Needed]
30
  - **Language(s) (NLP):** [More Information Needed]
31
  - **License:** [More Information Needed]
@@ -33,7 +34,7 @@ INT4 accuracy level: FP32 (float32)
33
 
34
  ### Model Sources [optional]
35
 
36
- <!-- Provide the basic links for the model. -->
37
 
38
  - **Repository:** [More Information Needed]
39
  - **Paper [optional]:** [More Information Needed]
 
15
 
16
  ### Model Description
17
  meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization
18
+
19
  https://onnxruntime.ai/docs/genai/howto/install.html#directml
20
 
21
  Created using ONNX Runtime GenAI's builder.py
22
+
23
  https://raw.githubusercontent.com/microsoft/onnxruntime-genai/main/src/python/py/models/builder.py
24
 
25
  INT4 accuracy level: FP32 (float32)
26
+
27
  8-bit quantization for MoE layers
28
 
29
  - **Developed by:** Mochamad Aris Zamroni
 
 
30
  - **Model type:** [More Information Needed]
31
  - **Language(s) (NLP):** [More Information Needed]
32
  - **License:** [More Information Needed]
 
34
 
35
  ### Model Sources [optional]
36
 
37
+ https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct
38
 
39
  - **Repository:** [More Information Needed]
40
  - **Paper [optional]:** [More Information Needed]