leafspark's picture
readme: fix imatrix info
dd1a343 verified
|
raw
history blame
390 Bytes
---
license: llama3.1
tags:
- gguf
- llama3
- llama
pipeline_tag: text-generation
---
# Meta-Llama-3.1-405B-Instruct-GGUF
Low bit quantizations of Meta's Llama 3.1 405B Instruct model. Quantized from ollama q4_0 GGUF.
**Quants:**
- Q2_K
- (imatrix)
- Q3_K_M
- Q3_K_S
- Q3_K_L
- Q4_K_M
- Q4_0
- Q4_K_S
## imatrix
Generated from Q2_K quant.
imatrix calibration data: `groups_merged.txt`