ajibawa-2023
commited on
Commit
·
cec263d
1
Parent(s):
804a538
Update README.md
Browse files
README.md
CHANGED
@@ -22,7 +22,7 @@ I have released the new data [Code-74k-ShareGPT](https://huggingface.co/datasets
|
|
22 |
Entire dataset was trained on 4 x A100 80GB. For 3 epoch, training took 6 days & 5 hours. DeepSpeed codebase was used for training purpose. This was trained on Llama-1 by Meta.
|
23 |
|
24 |
|
25 |
-
This is a full fine tuned model. Links for quantized models
|
26 |
|
27 |
|
28 |
**GPTQ GGUF & AWQ**
|
@@ -34,6 +34,7 @@ GGUF: [Link](https://huggingface.co/TheBloke/Code-33B-GGUF)
|
|
34 |
AWQ: [Link](https://huggingface.co/TheBloke/Code-33B-AWQ)
|
35 |
|
36 |
|
|
|
37 |
|
38 |
|
39 |
**Example Prompt:**
|
|
|
22 |
Entire dataset was trained on 4 x A100 80GB. For 3 epoch, training took 6 days & 5 hours. DeepSpeed codebase was used for training purpose. This was trained on Llama-1 by Meta.
|
23 |
|
24 |
|
25 |
+
This is a full fine tuned model. Links for quantized models are given below.
|
26 |
|
27 |
|
28 |
**GPTQ GGUF & AWQ**
|
|
|
34 |
AWQ: [Link](https://huggingface.co/TheBloke/Code-33B-AWQ)
|
35 |
|
36 |
|
37 |
+
Extremely thankful to [TheBloke](https://huggingface.co/TheBloke) for making Quantized versions of model.
|
38 |
|
39 |
|
40 |
**Example Prompt:**
|