Updating model files
Browse files
README.md
CHANGED
@@ -2,6 +2,17 @@
|
|
2 |
license: other
|
3 |
inference: false
|
4 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
# Vicuna 13B 1.1 GPTQ 4bit 128g
|
6 |
|
7 |
This is a 4-bit GPTQ version of the [Vicuna 13B 1.1 model](https://huggingface.co/lmsys/vicuna-13b-delta-v1.1).
|
@@ -21,7 +32,7 @@ I have the following Vicuna 1.1 repositories available:
|
|
21 |
**13B models:**
|
22 |
* [Unquantized 13B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-13B-1.1-HF)
|
23 |
* [GPTQ quantized 4bit 13B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g)
|
24 |
-
|
25 |
**7B models:**
|
26 |
* [Unquantized 7B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-7B-1.1-HF)
|
27 |
* [GPTQ quantized 4bit 7B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-7B-1.1-GPTQ-4bit-128g)
|
@@ -114,6 +125,17 @@ Then link that into `text-generation-webui/repositories` as described above.
|
|
114 |
|
115 |
Or just use `vicuna-13B-1.1-GPTQ-4bit-128g.compat.no-act-order.pt` as mentioned above, which should work without any upgrades to text-generation-webui.
|
116 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
117 |
# Vicuna Model Card
|
118 |
|
119 |
## Model details
|
@@ -152,4 +174,4 @@ A preliminary evaluation of the model quality is conducted by creating a set of
|
|
152 |
|
153 |
## Major updates of weights v1.1
|
154 |
- Refactor the tokenization and separator. In Vicuna v1.1, the separator has been changed from `"###"` to the EOS token `"</s>"`. This change makes it easier to determine the generation stop criteria and enables better compatibility with other libraries.
|
155 |
-
- Fix the supervised fine-tuning loss computation for better model quality.
|
|
|
2 |
license: other
|
3 |
inference: false
|
4 |
---
|
5 |
+
<div style="width: 100%;">
|
6 |
+
<img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
|
7 |
+
</div>
|
8 |
+
<div style="display: flex; justify-content: space-between; width: 100%;">
|
9 |
+
<div style="display: flex; flex-direction: column; align-items: flex-start;">
|
10 |
+
<p><a href="https://discord.gg/UBgz4VXf">Chat & support: my new Discord server</a></p>
|
11 |
+
</div>
|
12 |
+
<div style="display: flex; flex-direction: column; align-items: flex-end;">
|
13 |
+
<p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? Patreon coming soon!</a></p>
|
14 |
+
</div>
|
15 |
+
</div>
|
16 |
# Vicuna 13B 1.1 GPTQ 4bit 128g
|
17 |
|
18 |
This is a 4-bit GPTQ version of the [Vicuna 13B 1.1 model](https://huggingface.co/lmsys/vicuna-13b-delta-v1.1).
|
|
|
32 |
**13B models:**
|
33 |
* [Unquantized 13B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-13B-1.1-HF)
|
34 |
* [GPTQ quantized 4bit 13B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g)
|
35 |
+
|
36 |
**7B models:**
|
37 |
* [Unquantized 7B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-7B-1.1-HF)
|
38 |
* [GPTQ quantized 4bit 7B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-7B-1.1-GPTQ-4bit-128g)
|
|
|
125 |
|
126 |
Or just use `vicuna-13B-1.1-GPTQ-4bit-128g.compat.no-act-order.pt` as mentioned above, which should work without any upgrades to text-generation-webui.
|
127 |
|
128 |
+
## Want to support my work?
|
129 |
+
|
130 |
+
I've had a lot of people ask if they can contribute. I love providing models and helping people, but it is starting to rack up pretty big cloud computing bills.
|
131 |
+
|
132 |
+
So if you're able and willing to contribute, it'd be most gratefully received and will help me to keep providing models, and work on various AI projects.
|
133 |
+
|
134 |
+
Donaters will get priority support on any and all AI/LLM/model questions, and I'll gladly quantise any model you'd like to try.
|
135 |
+
|
136 |
+
* Patreon: coming soon! (just awaiting approval)
|
137 |
+
* Ko-Fi: https://ko-fi.com/TheBlokeAI
|
138 |
+
* Discord: https://discord.gg/UBgz4VXf
|
139 |
# Vicuna Model Card
|
140 |
|
141 |
## Model details
|
|
|
174 |
|
175 |
## Major updates of weights v1.1
|
176 |
- Refactor the tokenization and separator. In Vicuna v1.1, the separator has been changed from `"###"` to the EOS token `"</s>"`. This change makes it easier to determine the generation stop criteria and enables better compatibility with other libraries.
|
177 |
+
- Fix the supervised fine-tuning loss computation for better model quality.
|