leafspark
/

Meta-Llama-3.1-405B-Instruct-GGUF

Text Generation

Model card Files Files and versions Community

Meta-Llama-3.1-405B-Instruct-GGUF / README.md

leafspark's picture

readme: fix formatting

a29e824 verified 8 months ago

|

480 Bytes

	---
	license: llama3.1
	library_name: ggml
	---

	# Meta-Llama-3.1-405B-Instruct-GGUF

	Low bit quantizations of Meta's Llama 3.1 405B Instruct model. Quantized from ollama q4_0 GGUF.

	Quants:
	- Q2_K
	- (imatrix)
	- Q3_K_M
	- Q3_K_S
	- Q3_K_L
	- Q4_K_M
	- Q4_0
	- Q4_K_S

	## imatrix

	Experimental, force quanted to iq1_m, then an imatrix is generated and quanted to iq1_m again, and that is used to generate the final imatrix for all quants.

	imatrix calibration data: `groups_merged.dat`