update results for slerp
Browse files
README.md
CHANGED
@@ -15,11 +15,11 @@ pipeline_tag: text-generation
|
|
15 |
|
16 |
This is the DPO version of [wenbopan/Faro-Yi-9B](https://huggingface.co/wenbopan/Faro-Yi-9B). Compared to Faro-Yi-9B and [Yi-9B-200K](https://huggingface.co/01-ai/Yi-9B-200K), the DPO model excels at many tasks, surpassing the original Yi-9B-200K by a large margin.
|
17 |
|
18 |
-
| **Metric**
|
19 |
-
|
|
20 |
-
| **Yi-9B-200K**
|
21 |
-
| **Faro-9B**
|
22 |
-
| **Faro-9B-DPO** | 66.
|
23 |
|
24 |
|
25 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/62cd3a3691d27e60db0698b0/Oa9QSbXgaYVekrYfgfaiC.png)
|
|
|
15 |
|
16 |
This is the DPO version of [wenbopan/Faro-Yi-9B](https://huggingface.co/wenbopan/Faro-Yi-9B). Compared to Faro-Yi-9B and [Yi-9B-200K](https://huggingface.co/01-ai/Yi-9B-200K), the DPO model excels at many tasks, surpassing the original Yi-9B-200K by a large margin.
|
17 |
|
18 |
+
| **Metric** | **MMLU** | **GSM8K** | **hellaswag** | **truthfulqa** | **ai2_arc** | **winogrande** | **CMMLU** |
|
19 |
+
| ----------------------- | --------- | --------- | ------------- | -------------- | ----------- | -------------- | --------- |
|
20 |
+
| **Yi-9B-200K** | 65.73 | 50.49 | 56.72 | 33.80 | 69.25 | 71.67 | 71.97 |
|
21 |
+
| **Faro-Yi-9B** | 68.80 | 63.08 | 57.28 | 40.86 | 72.58 | 71.11 | 73.28 |
|
22 |
+
| **Faro-Yi-9B-DPO** | **69.98** | **66.11** | **59.04** | **48.01** | **75.68** | **73.40** | **75.23** |
|
23 |
|
24 |
|
25 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/62cd3a3691d27e60db0698b0/Oa9QSbXgaYVekrYfgfaiC.png)
|