denial07 commited on
Commit
48126df
·
verified ·
1 Parent(s): 66c7932

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -7,7 +7,7 @@ This model is an improved version for Korean, based on the [Qwen2-72B-Instruct](
7
 
8
 
9
  ### LogicKor Benchmark (24.07.31)
10
- * The following benchmark ranks are based on 1-shot evaluation.
11
  | Rank | Model | Reasoning | Math | Writing | Coding | Understanding | Grammar | Singleturn | Multiturn | Total | Parameters |
12
  |------|-------|-----------|-------|--------|--------|-------|---------|-----------|-----------|-------|---------|
13
  | 1 | openai/gpt-4o-2024-05-13 | 9.21 | 8.71 | 9.64 | 9.78 | 9.64 | 9.50 | 9.33 | 9.50 | 9.41 | ? |
@@ -20,7 +20,6 @@ This model is an improved version for Korean, based on the [Qwen2-72B-Instruct](
20
 
21
  ### KMMLU Benchmark
22
  * [HAERAE-HUB/KMMLU](https://huggingface.co/datasets/HAERAE-HUB/KMMLU) benchmark accuracy score.
23
-
24
  | Category |Qwen2-72B kor-dpo| Qwen2-72B | Questions |
25
  |-----------------|-----------------|------------|------------|
26
  | HUMSS | 0.63 | 0.63 | 5130 |
 
7
 
8
 
9
  ### LogicKor Benchmark (24.07.31)
10
+ * [The following benchmark](https://lk.instruct.kr/) ranks are based on 1-shot evaluation.
11
  | Rank | Model | Reasoning | Math | Writing | Coding | Understanding | Grammar | Singleturn | Multiturn | Total | Parameters |
12
  |------|-------|-----------|-------|--------|--------|-------|---------|-----------|-----------|-------|---------|
13
  | 1 | openai/gpt-4o-2024-05-13 | 9.21 | 8.71 | 9.64 | 9.78 | 9.64 | 9.50 | 9.33 | 9.50 | 9.41 | ? |
 
20
 
21
  ### KMMLU Benchmark
22
  * [HAERAE-HUB/KMMLU](https://huggingface.co/datasets/HAERAE-HUB/KMMLU) benchmark accuracy score.
 
23
  | Category |Qwen2-72B kor-dpo| Qwen2-72B | Questions |
24
  |-----------------|-----------------|------------|------------|
25
  | HUMSS | 0.63 | 0.63 | 5130 |