denial07
/

Qwen2-72B-Instruct-kor-dpo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

denial07 commited on Aug 3, 2024

Commit

48126df

·

verified ·

1 Parent(s): 66c7932

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ This model is an improved version for Korean, based on the [Qwen2-72B-Instruct](
 ### LogicKor Benchmark (24.07.31)
-* The following benchmark ranks are based on 1-shot evaluation.
 | Rank | Model | Reasoning | Math  | Writing | Coding | Understanding | Grammar | Singleturn | Multiturn | Total | Parameters |
 |------|-------|-----------|-------|--------|--------|-------|---------|-----------|-----------|-------|---------|
 | 1    | openai/gpt-4o-2024-05-13 | 9.21 | 8.71 | 9.64 | 9.78 | 9.64 | 9.50 | 9.33 | 9.50 | 9.41 | ? |
@@ -20,7 +20,6 @@ This model is an improved version for Korean, based on the [Qwen2-72B-Instruct](
 ### KMMLU Benchmark
 * [HAERAE-HUB/KMMLU](https://huggingface.co/datasets/HAERAE-HUB/KMMLU) benchmark accuracy score.
   | Category        |Qwen2-72B kor-dpo| Qwen2-72B  | Questions  |
   |-----------------|-----------------|------------|------------|
   | HUMSS           |     0.63        |   0.63     | 5130       |

 ### LogicKor Benchmark (24.07.31)
+* [The following benchmark](https://lk.instruct.kr/) ranks are based on 1-shot evaluation.
 | Rank | Model | Reasoning | Math  | Writing | Coding | Understanding | Grammar | Singleturn | Multiturn | Total | Parameters |
 |------|-------|-----------|-------|--------|--------|-------|---------|-----------|-----------|-------|---------|
 | 1    | openai/gpt-4o-2024-05-13 | 9.21 | 8.71 | 9.64 | 9.78 | 9.64 | 9.50 | 9.33 | 9.50 | 9.41 | ? |
 ### KMMLU Benchmark
 * [HAERAE-HUB/KMMLU](https://huggingface.co/datasets/HAERAE-HUB/KMMLU) benchmark accuracy score.
   | Category        |Qwen2-72B kor-dpo| Qwen2-72B  | Questions  |
   |-----------------|-----------------|------------|------------|
   | HUMSS           |     0.63        |   0.63     | 5130       |