Update README.md
Browse files
README.md
CHANGED
@@ -17,3 +17,14 @@ fine-tuned on the MFANN dataset as it stands on 5/2/2024 as it is an ever changi
|
|
17 |
|
18 |
|
19 |
WHY IS MY 8B MODEL FAILING BENCHMARKS HUGGINGFACE!!!!!!!!!!!!!!!!!!!!!!!!!
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
|
18 |
|
19 |
WHY IS MY 8B MODEL FAILING BENCHMARKS HUGGINGFACE!!!!!!!!!!!!!!!!!!!!!!!!!
|
20 |
+
|
21 |
+
|
22 |
+
benchmark results for this 3b model:
|
23 |
+
|
24 |
+
64.34 <-- Average
|
25 |
+
62.63 <-- Arc
|
26 |
+
77.1 <-- HellaSwag
|
27 |
+
58.43 <-- MMLU
|
28 |
+
51.71 <-- TruthfulQA
|
29 |
+
74.66 <-- Winogrande
|
30 |
+
61.49 <-- GSM8K
|