Update README.md
Browse files
README.md
CHANGED
@@ -89,22 +89,22 @@ This release marks the one-year anniversary of SauerkrautLM, showcasing our most
|
|
89 |
## Evaluation
|
90 |
|
91 |
**AGIEVAL**
|
92 |
-
![SauerkrautLM-v2-14b-SFT-AGIEVAL](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
93 |
|
94 |
**GPT4ALL**
|
95 |
-
![SauerkrautLM-v2-14b-SFT-GPT4ALL](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
96 |
|
97 |
**TRUTHFULQA**
|
98 |
-
![SauerkrautLM-v2-14b-SFT-TRUTHFULQA](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
99 |
|
100 |
**OPENLEADERBOARD 2**
|
101 |
-
![SauerkrautLM-v2-14b-SFT-OPENLEADERBOARD](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
102 |
|
103 |
**MMLU 5-shot**
|
104 |
-
![SauerkrautLM-v2-14b-SFT-MMLU-5shot](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
105 |
|
106 |
**Berkeley Function Calling Leaderboard**
|
107 |
-
![SauerkrautLM-v2-14b-SFT-BERKELEY](https://vago-solutions.ai/wp-content/uploads/2024/11/
|
108 |
|
109 |
Please note that our benchmark results in absolute numbers may differ from the Hugging Face Leaderboard due to variations in benchmark evaluation pipelines. However, the relative differences remain consistent.
|
110 |
|
|
|
89 |
## Evaluation
|
90 |
|
91 |
**AGIEVAL**
|
92 |
+
![SauerkrautLM-v2-14b-SFT-AGIEVAL](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-AGIEVAL.png "SauerkrautLM-v2-14b-SFT-AGIEVAL")
|
93 |
|
94 |
**GPT4ALL**
|
95 |
+
![SauerkrautLM-v2-14b-SFT-GPT4ALL](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-GPT4ALL.png "SauerkrautLM-v2-14b-SFT-GPT4ALL")
|
96 |
|
97 |
**TRUTHFULQA**
|
98 |
+
![SauerkrautLM-v2-14b-SFT-TRUTHFULQA](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-TRUTHFULQA.png "SauerkrautLM-v2-14b-SFT-TRUTHFULQA")
|
99 |
|
100 |
**OPENLEADERBOARD 2**
|
101 |
+
![SauerkrautLM-v2-14b-SFT-OPENLEADERBOARD](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-OPENLEADERBOARD.png "SauerkrautLM-v2-14b-SFT-OPENLEADERBOARD")
|
102 |
|
103 |
**MMLU 5-shot**
|
104 |
+
![SauerkrautLM-v2-14b-SFT-MMLU-5shot](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-MMLU-5shot.png "SauerkrautLM-v2-14b-SFT-MMLU-5shot")
|
105 |
|
106 |
**Berkeley Function Calling Leaderboard**
|
107 |
+
![SauerkrautLM-v2-14b-SFT-BERKELEY](https://vago-solutions.ai/wp-content/uploads/2024/11/SauerkrautLM-v2-14b-DPO-BERKELEY.png "SauerkrautLM-v2-14b-SFT-BERKELEY")
|
108 |
|
109 |
Please note that our benchmark results in absolute numbers may differ from the Hugging Face Leaderboard due to variations in benchmark evaluation pipelines. However, the relative differences remain consistent.
|
110 |
|