Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ This model is an improved version for Korean, based on the [Qwen2-72B-Instruct](
|
|
7 |
|
8 |
|
9 |
### LogicKor Benchmark (24.07.31)
|
10 |
-
* The following benchmark ranks are based on 1-shot evaluation.
|
11 |
| Rank | Model | Reasoning | Math | Writing | Coding | Understanding | Grammar | Singleturn | Multiturn | Total | Parameters |
|
12 |
|------|-------|-----------|-------|--------|--------|-------|---------|-----------|-----------|-------|---------|
|
13 |
| 1 | openai/gpt-4o-2024-05-13 | 9.21 | 8.71 | 9.64 | 9.78 | 9.64 | 9.50 | 9.33 | 9.50 | 9.41 | ? |
|
@@ -20,7 +20,6 @@ This model is an improved version for Korean, based on the [Qwen2-72B-Instruct](
|
|
20 |
|
21 |
### KMMLU Benchmark
|
22 |
* [HAERAE-HUB/KMMLU](https://huggingface.co/datasets/HAERAE-HUB/KMMLU) benchmark accuracy score.
|
23 |
-
|
24 |
| Category |Qwen2-72B kor-dpo| Qwen2-72B | Questions |
|
25 |
|-----------------|-----------------|------------|------------|
|
26 |
| HUMSS | 0.63 | 0.63 | 5130 |
|
|
|
7 |
|
8 |
|
9 |
### LogicKor Benchmark (24.07.31)
|
10 |
+
* [The following benchmark](https://lk.instruct.kr/) ranks are based on 1-shot evaluation.
|
11 |
| Rank | Model | Reasoning | Math | Writing | Coding | Understanding | Grammar | Singleturn | Multiturn | Total | Parameters |
|
12 |
|------|-------|-----------|-------|--------|--------|-------|---------|-----------|-----------|-------|---------|
|
13 |
| 1 | openai/gpt-4o-2024-05-13 | 9.21 | 8.71 | 9.64 | 9.78 | 9.64 | 9.50 | 9.33 | 9.50 | 9.41 | ? |
|
|
|
20 |
|
21 |
### KMMLU Benchmark
|
22 |
* [HAERAE-HUB/KMMLU](https://huggingface.co/datasets/HAERAE-HUB/KMMLU) benchmark accuracy score.
|
|
|
23 |
| Category |Qwen2-72B kor-dpo| Qwen2-72B | Questions |
|
24 |
|-----------------|-----------------|------------|------------|
|
25 |
| HUMSS | 0.63 | 0.63 | 5130 |
|