slimfrikha-tii
commited on
fix typo benchs
Browse files
README.md
CHANGED
@@ -222,7 +222,7 @@ We report in the following table our internal pipeline benchmarks.
|
|
222 |
</tr>
|
223 |
<tr>
|
224 |
<td>IFEval</td>
|
225 |
-
<td>57.
|
226 |
<td>63.4</td>
|
227 |
<td><b>78</b></td>
|
228 |
</tr>
|
@@ -235,9 +235,9 @@ We report in the following table our internal pipeline benchmarks.
|
|
235 |
</tr>
|
236 |
<tr>
|
237 |
<td>GSM8K (8-shot, COT)</td>
|
238 |
-
<td>
|
239 |
-
<td>
|
240 |
-
<td><b>
|
241 |
</tr>
|
242 |
<tr>
|
243 |
<td>MATH Lvl-5 (4-shot)</td>
|
|
|
222 |
</tr>
|
223 |
<tr>
|
224 |
<td>IFEval</td>
|
225 |
+
<td>57.8</td>
|
226 |
<td>63.4</td>
|
227 |
<td><b>78</b></td>
|
228 |
</tr>
|
|
|
235 |
</tr>
|
236 |
<tr>
|
237 |
<td>GSM8K (8-shot, COT)</td>
|
238 |
+
<td>76</td>
|
239 |
+
<td>80.4</td>
|
240 |
+
<td><b>84.6</b></td>
|
241 |
</tr>
|
242 |
<tr>
|
243 |
<td>MATH Lvl-5 (4-shot)</td>
|