squeeze-ai-lab
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ For more details please check out our [paper](https://arxiv.org/abs/2401.18079.p
|
|
15 |
|
16 |
## Model description
|
17 |
|
18 |
-
Quantizer file for running DBRX with
|
19 |
|
20 |
* **Base Model:** [DBRX-Instruct](https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm)
|
21 |
* **Bitwidth:** 4-bit
|
|
|
15 |
|
16 |
## Model description
|
17 |
|
18 |
+
Quantizer file for running DBRX with 4-bit KV cache using KVQuant.
|
19 |
|
20 |
* **Base Model:** [DBRX-Instruct](https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm)
|
21 |
* **Bitwidth:** 4-bit
|