leafspark
/

Meta-Llama-3.1-405B-Instruct-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Meta-Llama-3.1-405B-Instruct-GGUF / README.md

leafspark's picture

readme: fix imatrix info

dd1a343 verified 6 months ago

|

390 Bytes

	---
	license: llama3.1
	tags:
	- gguf
	- llama3
	- llama
	pipeline_tag: text-generation
	---

	# Meta-Llama-3.1-405B-Instruct-GGUF

	Low bit quantizations of Meta's Llama 3.1 405B Instruct model. Quantized from ollama q4_0 GGUF.

	Quants:
	- Q2_K
	- (imatrix)
	- Q3_K_M
	- Q3_K_S
	- Q3_K_L
	- Q4_K_M
	- Q4_0
	- Q4_K_S

	## imatrix

	Generated from Q2_K quant.

	imatrix calibration data: `groups_merged.txt`