Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,15 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- argilla/distilabel-intel-orca-dpo-pairs
|
5 |
+
tags:
|
6 |
+
- dpo
|
7 |
+
- 13B
|
8 |
---
|
9 |
+
|
10 |
+
# solarized-18B-dpo
|
11 |
+
|
12 |
+
DPO'd from vicgalle/SOLAR-13B-Instruct-v1.0, a SOLAR-like model upscaled to 13B.
|
13 |
+
It is a frankenmerge model created using mergekit, alternating layers of Nous-Hermes-2-SOLAR-10.7B and SOLAR-10.7B-Instruct. Then, we applied DPO over a high-quality preference dataset.
|
14 |
+
|
15 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fad8602b8423e1d80b8a965/rNtaTqTKrAoN5-C5DuPgu.png)
|