nicholasKluge
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -45,8 +45,7 @@ co2_eq_emissions:
|
|
45 |
---
|
46 |
# TeenyTinyLlama-460m-Chat-awq
|
47 |
|
48 |
-
**Note: This model is a quantized version of [TeenyTinyLlama-460m
|
49 |
-
|
50 |
TeenyTinyLlama is a pair of small foundational models trained in Brazilian Portuguese.
|
51 |
|
52 |
This repository contains a version of [TeenyTinyLlama-460m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-460m) (`TeenyTinyLlama-460m-Chat`) fine-tuned on the [Instruct-Aira Dataset version 2.0](https://huggingface.co/datasets/nicholasKluge/instruct-aira-dataset-v2).
|
|
|
45 |
---
|
46 |
# TeenyTinyLlama-460m-Chat-awq
|
47 |
|
48 |
+
**Note: This model is a quantized version of [TeenyTinyLlama-460m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-460m). Quantization was performed using [AutoAWQ](https://github.com/casper-hansen/AutoAWQ), allowing this version to be 80% lighter, 20% faster, and with almost no performance loss. A GPU is required to run the AWQ-quantized models.**
|
|
|
49 |
TeenyTinyLlama is a pair of small foundational models trained in Brazilian Portuguese.
|
50 |
|
51 |
This repository contains a version of [TeenyTinyLlama-460m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-460m) (`TeenyTinyLlama-460m-Chat`) fine-tuned on the [Instruct-Aira Dataset version 2.0](https://huggingface.co/datasets/nicholasKluge/instruct-aira-dataset-v2).
|