appvoid
/

arco-2.1

Safetensors

llama

Model card Files Files and versions Community

appvoid commited on 27 days ago

Commit

580f1f3

verified ·

1 Parent(s): f88837f

Update README.md

Browse files

Files changed (1) hide show

README.md +37 -33

README.md CHANGED Viewed

@@ -1,43 +1,47 @@
 ---
-base_model:
-- appvoid/arco-2
-- appvoid/arco-exp-12
-- appvoid/arco-2-reasoning-20k
-- appvoid/text-arco
-library_name: transformers
-tags:
-- mergekit
-- merge
 ---
-# merge
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [appvoid/arco-exp-12](https://huggingface.co/appvoid/arco-exp-12) as a base.
-### Models Merged
-The following models were included in the merge:
-* [appvoid/arco-2](https://huggingface.co/appvoid/arco-2)
-* [appvoid/arco-2-reasoning-20k](https://huggingface.co/appvoid/arco-2-reasoning-20k)
-* [appvoid/text-arco](https://huggingface.co/appvoid/text-arco)
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-models:
-    - model: appvoid/arco-2-reasoning-20k
-    - model: appvoid/arco-2
-    - model: appvoid/text-arco
-merge_method: model_stock
-base_model: appvoid/arco-exp-12
-normalize: false
-int8_mask: true
-dtype: float16
-```

 ---
+license: apache-2.0
 ---
+<style>
+    img{
+    user-select: none;
+    transition: all 0.2s ease;
+    border-radius: .5rem;
+  }
+    img:hover{
+    transform: rotate(2deg);
+    filter: invert(100%);
+  }
+@import url('https://fonts.googleapis.com/css2?family=Vollkorn:ital,wght@0,400..900;1,400..900&display=swap');
+</style>
+<div style="background-color: transparent; border-radius: .5rem; padding: 2rem; font-family: monospace; font-size: .85rem; text-align: justify;">
+![cubby](https://huggingface.co/appvoid/cubby/resolve/main/cubby.webp)
+This is the latest iteration as an effort to make arco as good on arc as it can. So far it improved a little.
+#### prompt
+there is no prompt intentionally set.
+#### benchmarks
+zero-shot results from state-of-the-art small language models
+| Parameters | Model                          | MMLU  | ARC-C | HellaSwag | PIQA   | Winogrande | Average |
+| -----------|--------------------------------|-------|-------|-----------|--------|------------|---------|
+| 0.5b       | danube 3                       | 24.81| 36.18| 60.46| 73.78 | 61.01     | 51.25  |
+| 0.5b       | arco                           |**26.17**|37.29|62.88|74.37|**62.27**|52.60|
+| 0.5b       | arco 2                         |25.51|38.82|63.02|**74.70**|61.25|**52.66**|
+| 0.5b       | arco 2º                        |25.47|**38.99**|**63.03**|**74.70**|61.01|52.64|
+#### supporters
+<a href="https://ko-fi.com/appvoid" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 34px !important; margin-top: -4px;width: 128px !important; filter: contrast(2) grayscale(100%) brightness(100%);" ></a>
+### trivia
+arco seems to keep improving on the same 3 benchmarks, hellaswag reached its limit though.
+</div>