leafspark
/

Meta-Llama-3.1-405B-Instruct-GGUF

Text Generation

Model card Files Files and versions Community

leafspark commited on Jul 24, 2024

Commit

362d372

·

verified ·

1 Parent(s): af45a20

readme: add model card

Files changed (1) hide show

README.md +24 -3

README.md CHANGED Viewed

@@ -1,3 +1,24 @@
----
-license: llama3.1
----

+---
+license: llama3.1
+library_name: ggml
+---
+# Meta-Llama-3.1-405B-Instruct-GGUF
+Low bit quantizations of Meta's Llama 3.1 405B Instruct model. Quantized from ollama q4_0 GGUF.
+**Quants:**
+- Q2_K
+- (imatrix)
+- Q3_K_M
+- Q3_K_S
+- Q3_K_L
+- Q4_K_M
+- Q4_0
+- Q4_K_S
+imatrix calibration data: groups_merged.dat
+## imatrix
+Experimental, force quanted to iq1_m, then an imatrix is generated and quanted to iq1_m again, and that is used to generate the final imatrix for all quants.