license: llama3.1 | |
library_name: ggml | |
# Meta-Llama-3.1-405B-Instruct-GGUF | |
Low bit quantizations of Meta's Llama 3.1 405B Instruct model. Quantized from ollama q4_0 GGUF. | |
**Quants:** | |
- Q2_K | |
- (imatrix) | |
- Q3_K_M | |
- Q3_K_S | |
- Q3_K_L | |
- Q4_K_M | |
- Q4_0 | |
- Q4_K_S | |
## imatrix | |
Experimental, force quanted to iq1_m, then an imatrix is generated and quanted to iq1_m again, and that is used to generate the final imatrix for all quants. | |
imatrix calibration data: `groups_merged.dat` |