Adding Evaluation Results
#1
by
leaderboard-pr-bot
- opened
README.md
CHANGED
@@ -200,4 +200,17 @@ ted together.
|
|
200 |
|
201 |
With tears in their eyes and heavy hearts, they bid each other farewell, promising to keep in touch and meet again soon. And so, their epic journey came to an end. But the memories would remain with them forever, reminding
|
202 |
them of the power of friendship, the beauty of nature, and the importance of discovering new worlds.
|
203 |
-
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
200 |
|
201 |
With tears in their eyes and heavy hearts, they bid each other farewell, promising to keep in touch and meet again soon. And so, their epic journey came to an end. But the memories would remain with them forever, reminding
|
202 |
them of the power of friendship, the beauty of nature, and the importance of discovering new worlds.
|
203 |
+
```
|
204 |
+
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
205 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_openaccess-ai-collective__minotaur-13b-fixed)
|
206 |
+
|
207 |
+
| Metric | Value |
|
208 |
+
|-----------------------|---------------------------|
|
209 |
+
| Avg. | 49.57 |
|
210 |
+
| ARC (25-shot) | 59.04 |
|
211 |
+
| HellaSwag (10-shot) | 81.66 |
|
212 |
+
| MMLU (5-shot) | 50.1 |
|
213 |
+
| TruthfulQA (0-shot) | 50.36 |
|
214 |
+
| Winogrande (5-shot) | 76.87 |
|
215 |
+
| GSM8K (5-shot) | 13.12 |
|
216 |
+
| DROP (3-shot) | 15.83 |
|