Statuo
/

LemonKunoichiWizardv3_8bpw

Text Generation

text-generation-inference

Inference Endpoints

8-bit precision

Model card Files Files and versions Community

Statuo commited on May 18, 2024

Commit

200775d

·

verified ·

1 Parent(s): 461d89b

Update README.md

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

@@ -1,4 +1,28 @@
 ---
 base_model:
 - SanjiWatsuki/Kunoichi-DPO-v2-7B
 - dreamgen/WizardLM-2-7B
@@ -42,4 +66,4 @@ models:
       weight: 0.6
 merge_method: linear
 dtype: float16
-```

 ---
+{}
+---
+# Lemon Kunoichi Wizard - 7b
+![LemonKunoichiWizard](https://files.catbox.moe/eivabp.png)
+[Base Model](https://huggingface.co/Statuo/LemonKunoichiWizardV3/), [4bpw](https://huggingface.co/Statuo/LemonKunoichiWizardv3_4bpw), [6bpw](https://huggingface.co/Statuo/LemonKunoichiWizardv3_6bpw), [8bpw](https://huggingface.co/Statuo/LemonKunoichiWizardv3_8bpw)
+The Quanted versions come with the measurement files in case you want to do your own quants.
+A merge of three models, LemonadeRP-4.5.3, Kunoichi-DPO-v2, and WizardLM-2. I used Lemonade as a base with Kunoichi being the second biggest influence and WizardLM-2 for logic capabilities.
+The end result is a Roleplay-focused model with great character card inference. I ran 4 merges at varying values to see which provided the most accurate output to a character cards quirk, with this v3 version being the winner out of the four.
+## Context Template - Alpaca
+Alpaca preset seems to work well with your own System Prompt.
+## Context Size - 8192
+The model loads at 8192 on my end, but theoretically it should be able to go up to 32k. Not that it'll be coherent at 32k. Most models based on Mistral like this end up being - at best - 12k context size for coherent output. I only tested at 8k which is where the base models tend to shine. YMMV otherwise.
+---
 base_model:
 - SanjiWatsuki/Kunoichi-DPO-v2-7B
 - dreamgen/WizardLM-2-7B
       weight: 0.6
 merge_method: linear
 dtype: float16
+```