ichigoberry
commited on
add eval results
Browse files
README.md
CHANGED
@@ -19,6 +19,23 @@ MonarchPipe-7B-slerp is a merge of the following models using [LazyMergekit](htt
|
|
19 |
* [OpenPipe/mistral-ft-optimized-1227](https://huggingface.co/OpenPipe/mistral-ft-optimized-1227)
|
20 |
* [mlabonne/AlphaMonarch-7B](https://huggingface.co/mlabonne/AlphaMonarch-7B)
|
21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
22 |
## 𧩠Configuration
|
23 |
|
24 |
```yaml
|
|
|
19 |
* [OpenPipe/mistral-ft-optimized-1227](https://huggingface.co/OpenPipe/mistral-ft-optimized-1227)
|
20 |
* [mlabonne/AlphaMonarch-7B](https://huggingface.co/mlabonne/AlphaMonarch-7B)
|
21 |
|
22 |
+
|
23 |
+
## π Eval
|
24 |
+
|
25 |
+
### Nous
|
26 |
+
|
27 |
+
Eval results from the Nous benchmark suite (performed using LLM AutoEval).
|
28 |
+
|
29 |
+
| Model | Average | AGIEval | GPT4All | TruthfulQA | Bigbench |
|
30 |
+
|---|---:|---:|---:|---:|---:|
|
31 |
+
| [**MonarchPipe-7B-slerp**](https://huggingface.co/ichigoberry/MonarchPipe-7B-slerp) [π](https://gist.github.com/tosh/3d93f4e3d2c65935bf2f4f9a46791352)| 58.77| **46.12**| 74.89| 66.59| 47.49|
|
32 |
+
| [AlphaMonarch-7B](https://huggingface.co/mlabonne/AlphaMonarch-7B) [π](https://gist.github.com/mlabonne/1d33c86824b3a11d2308e36db1ba41c1) | **62.74** | 45.37 | **77.01** | **78.39** | **50.2** |
|
33 |
+
| [Monarch-7B](https://huggingface.co/mlabonne/Monarch-7B) [π](https://gist.github.com/mlabonne/0b8d057c5ece41e0290580a108c7a093) | 62.68 | 45.48 | 77.07 | 78.04 | 50.14 |
|
34 |
+
| [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B) [π](https://gist.github.com/mlabonne/88b21dd9698ffed75d6163ebdc2f6cc8) | 52.42 | 42.75 | 72.99 | 52.99 | 40.94 |
|
35 |
+
| [mlabonne/NeuralHermes-2.5-Mistral-7B](https://huggingface.co/mlabonne/NeuralHermes-2.5-Mistral-7B) [π](https://gist.github.com/mlabonne/14687f1eb3425b166db511f31f8e66f6) | 53.51 | 43.67 | 73.24 | 55.37 | 41.76 |
|
36 |
+
|
37 |
+
|
38 |
+
|
39 |
## 𧩠Configuration
|
40 |
|
41 |
```yaml
|