Update README.md
Browse files
README.md
CHANGED
@@ -55,6 +55,18 @@ datasets:
|
|
55 |
- **License:** apache-2.0
|
56 |
- **Finetuned from model :** LeroyDyer/Mixtral_AI_CyberTron_Ultra
|
57 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
58 |
|
59 |
# What does he NOT KNOW ! that is the question!
|
60 |
|
|
|
55 |
- **License:** apache-2.0
|
56 |
- **Finetuned from model :** LeroyDyer/Mixtral_AI_CyberTron_Ultra
|
57 |
|
58 |
+
[<img src="https://cdn-avatars.huggingface.co/v1/production/uploads/65d883893a52cd9bcd8ab7cf/tRsCJlHNZo1D02kBTmfy9.jpeg" width="300"/>
|
59 |
+
https://github.com/spydaz
|
60 |
+
|
61 |
+
|
62 |
+
* The Mistral-7B-Instruct-v0.2 Large Language Model (LLM) is an instruct fine-tuned version of the Mistral-7B-v0.2.
|
63 |
+
|
64 |
+
* Mistral-7B-v0.2 has the following changes compared to Mistral-7B-v0.1
|
65 |
+
|
66 |
+
* 32k context window (vs 8k context in v0.1)
|
67 |
+
* Rope-theta = 1e6
|
68 |
+
* No Sliding-Window Attention
|
69 |
+
|
70 |
|
71 |
# What does he NOT KNOW ! that is the question!
|
72 |
|