IconicAI
/

NeuralHermes-2.5-Mistral-7B-exl2-5bpw

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

anokas commited on Nov 29, 2023

Commit

ab31d64

·

1 Parent(s): 4120dc8

Update README.md

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -10,6 +10,7 @@ tags:
 - distillation
 - dpo
 - rlhf
 license: apache-2.0
 language:
 - en
@@ -17,6 +18,20 @@ datasets:
 - mlabonne/chatml_dpo_pairs
 ---
 <center><img src="https://i.imgur.com/qIhaFNM.png"></center>
 # NeuralHermes 2.5 - Mistral 7B

 - distillation
 - dpo
 - rlhf
+- exl2
 license: apache-2.0
 language:
 - en
 - mlabonne/chatml_dpo_pairs
 ---
+EXL2 quantisation of NeuralHermes-2.5-Mistral-7B, for use with ExLLamaV2.
+[Original model](https://huggingface.co/mlabonne/NeuralHermes-2.5-Mistral-7B) by @mlabonne.
+**Model size:** 4.6GB (3x reduction), 5 bits-per-weight average, 6bpw on head.
+**Calibration Data:** Wikitext [(parquet)](https://huggingface.co/datasets/wikitext/blob/refs%2Fconvert%2Fparquet/wikitext-2-v1/train/0000.parquet)
+**Command:** `python convert.py -i convert/NeuralHermes-2.5-Mistral-7B -c convert/0000.parquet -o convert/temp2 -cf convert/nh-5bpw -b 5.0 -hb 6`
+Layer measurements are provided in `measurement.json`` for further quantisation.
+---
 <center><img src="https://i.imgur.com/qIhaFNM.png"></center>
 # NeuralHermes 2.5 - Mistral 7B