TeeZee
/

Reflection-Llama-3.1-70B-GGUF

Inference Endpoints

Model card Files Files and versions Community

Reflection-Llama-3.1-70B-GGUF / README.md

TeeZee's picture

Update README.md

0e83de4 verified 5 months ago

|

history blame contribute delete

752 Bytes

	---
	license: llama3
	---
	Q4_K_M GGUF quant of [Reflection-Llama-3.1-70B](https://huggingface.co/mattshumer/Reflection-Llama-3.1-70B) - fixed version.<br>
	Runs great on 48GB VRAM, tested.<br>
	Ollama modelfile added - version with original system prompt - output is split into "thinking" and "output" tags.<br>
	If you want llama 3.1 'vanilla' experience, just remove SYSTEM from modelfile before creating ollama model.<br><br>
	All comments are greatly appreciated, download, test and if you appreciate my work, consider buying me my fuel:
	<a href="https://www.buymeacoffee.com/TeeZee" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 60px !important;width: 217px !important;" ></a>