vicgalle commited on
Commit
01e9582
·
verified ·
1 Parent(s): af02157

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -1,3 +1,15 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ datasets:
4
+ - argilla/distilabel-intel-orca-dpo-pairs
5
+ tags:
6
+ - dpo
7
+ - 13B
8
  ---
9
+
10
+ # solarized-18B-dpo
11
+
12
+ DPO'd from vicgalle/SOLAR-13B-Instruct-v1.0, a SOLAR-like model upscaled to 13B.
13
+ It is a frankenmerge model created using mergekit, alternating layers of Nous-Hermes-2-SOLAR-10.7B and SOLAR-10.7B-Instruct. Then, we applied DPO over a high-quality preference dataset.
14
+
15
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fad8602b8423e1d80b8a965/rNtaTqTKrAoN5-C5DuPgu.png)