File size: 1,782 Bytes
f88837f
580f1f3
f88837f
 
 
580f1f3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f88837f
580f1f3
f88837f
580f1f3
f88837f
580f1f3
 
 
 
 
 
 
f88837f
580f1f3
f88837f
580f1f3
f88837f
6be9b1b
580f1f3
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
---
license: apache-2.0
---


<style>
    img{
    user-select: none;
    transition: all 0.2s ease;
    border-radius: .5rem;
  }
    img:hover{
    transform: rotate(2deg);
    filter: invert(100%);
  }
@import url('https://fonts.googleapis.com/css2?family=Vollkorn:ital,wght@0,400..900;1,400..900&display=swap');
</style>

<div style="background-color: transparent; border-radius: .5rem; padding: 2rem; font-family: monospace; font-size: .85rem; text-align: justify;">
  
![cubby](https://huggingface.co/appvoid/cubby/resolve/main/cubby.webp)

This is the latest iteration as an effort to make arco as good on arc as it can. So far it improved a little.

#### prompt

there is no prompt intentionally set.


#### benchmarks

zero-shot results from state-of-the-art small language models

| Parameters | Model                          | MMLU  | ARC-C | HellaSwag | PIQA   | Winogrande | Average |
| -----------|--------------------------------|-------|-------|-----------|--------|------------|---------|
| 0.5b       | danube 3                       | 24.81| 36.18| 60.46| 73.78 | 61.01     | 51.25  |
| 0.5b       | arco                           |**26.17**|37.29|62.88|74.37|**62.27**|52.60|
| 0.5b       | arco 2                         |25.51|38.82|63.02|**74.70**|61.25|**52.66**|
| 0.5b       | arco 2º                        |25.47|**38.99**|**63.03**|**74.70**|61.01|52.64|
#### supporters

<a href="https://ko-fi.com/appvoid" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 34px !important; margin-top: -4px;width: 128px !important; filter: contrast(2) grayscale(100%) brightness(100%);" ></a>

### trivia

arco seems to keep improving on the same 3 benchmarks, reached its limit though.
</div>