|
--- |
|
license: llama3 |
|
--- |
|
Q4_K_M GGUF quant of [Reflection-Llama-3.1-70B](https://huggingface.co/mattshumer/Reflection-Llama-3.1-70B) - fixed version.<br> |
|
Runs great on 48GB VRAM, tested.<br> |
|
Ollama modelfile added - version with original system prompt - output is split into "thinking" and "output" tags.<br> |
|
If you want llama 3.1 'vanilla' experience, just remove SYSTEM from modelfile before creating ollama model.<br><br> |
|
All comments are greatly appreciated, download, test and if you appreciate my work, consider buying me my fuel: |
|
<a href="https://www.buymeacoffee.com/TeeZee" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 60px !important;width: 217px !important;" ></a> |