readme: update model card
Browse files
README.md
CHANGED
@@ -29,6 +29,8 @@ Quantized with llama.cpp [b3449](https://github.com/ggerganov/llama.cpp/releases
|
|
29 |
- Q4_0
|
30 |
- Q4_K_S
|
31 |
|
|
|
|
|
32 |
## imatrix
|
33 |
|
34 |
Generated from Q2_K quant.
|
|
|
29 |
- Q4_0
|
30 |
- Q4_K_S
|
31 |
|
32 |
+
For higher quality quantizations (q4+), please refer to [nisten/meta-405b-instruct-cpu-optimized-gguf](https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf).
|
33 |
+
|
34 |
## imatrix
|
35 |
|
36 |
Generated from Q2_K quant.
|