BramVanroy's picture
Create README.md
f794845 verified
|
raw
history blame
649 Bytes
metadata
license: mit
language:
  - nl
tags:
  - gguf

This repository contains quantized versions of BramVanroy/fietje-2b-instruct:

  • -f16 (5.6GB): best quality, but largest and slowest
  • -q8_0 (3.0GB): minimal quality loss, smaller
  • -q5_k_m (2.0GB): users have reported considerable quality loss in the chat q5_k_m version so you may want to avoid it

Also available on ollama:

# defaults to f16
ollama run bramvanroy/fietje-2b-instruct
ollama run bramvanroy/fietje-2b-instruct:f16
ollama run bramvanroy/fietje-2b-instruct:q8_0
ollama run bramvanroy/fietje-2b-instruct:q5_k_m