metadata
license: mit
language:
- nl
tags:
- gguf
This repository contains quantized versions of BramVanroy/fietje-2b-instruct:
-f16
(5.6GB): best quality, but largest and slowest-q8_0
(3.0GB): minimal quality loss, smaller-q5_k_m
(2.0GB): users have reported considerable quality loss in the chatq5_k_m
version so you may want to avoid it
Also available on ollama:
# defaults to f16
ollama run bramvanroy/fietje-2b-instruct
ollama run bramvanroy/fietje-2b-instruct:f16
ollama run bramvanroy/fietje-2b-instruct:q8_0
ollama run bramvanroy/fietje-2b-instruct:q5_k_m