jambroz commited on
Commit
23fb673
·
verified ·
1 Parent(s): a15e16b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +56 -1
README.md CHANGED
@@ -2,4 +2,59 @@ Hi..
2
 
3
  A pretty decent 7b chat... it probably could use a finetune, however this comes from some decent models smashed together.
4
 
5
- I'm not good at writing descriptions...
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
 
3
  A pretty decent 7b chat... it probably could use a finetune, however this comes from some decent models smashed together.
4
 
5
+ I'm not good at writing descriptions...
6
+
7
+ ---
8
+ base_model:
9
+ - Intel/neural-chat-7b-v3-1
10
+ - mlabonne/AlphaMonarch-7B
11
+ - HuggingFaceH4/zephyr-7b-beta
12
+ - mlabonne/NeuralBeagle14-7B
13
+ library_name: transformers
14
+ tags:
15
+ - mergekit
16
+ - merge
17
+ license: apache-2.0
18
+ ---
19
+ # merge
20
+
21
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
22
+
23
+ ## Merge Details
24
+ ### Merge Method
25
+
26
+ This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [mlabonne/NeuralBeagle14-7B](https://huggingface.co/mlabonne/NeuralBeagle14-7B) as a base.
27
+
28
+ ### Models Merged
29
+
30
+ The following models were included in the merge:
31
+ * [Intel/neural-chat-7b-v3-1](https://huggingface.co/Intel/neural-chat-7b-v3-1)
32
+ * [mlabonne/AlphaMonarch-7B](https://huggingface.co/mlabonne/AlphaMonarch-7B)
33
+ * [HuggingFaceH4/zephyr-7b-beta](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta)
34
+
35
+ ### Configuration
36
+
37
+ The following YAML configuration was used to produce this model:
38
+
39
+ ```yaml
40
+ base_model: mlabonne/NeuralBeagle14-7B
41
+ dtype: bfloat16
42
+ merge_method: dare_ties
43
+ models:
44
+ - model: mlabonne/NeuralBeagle14-7B
45
+ - model: mlabonne/AlphaMonarch-7B
46
+ parameters:
47
+ density: '0.53'
48
+ weight: '0.4'
49
+ - model: Intel/neural-chat-7b-v3-1
50
+ parameters:
51
+ density: '0.53'
52
+ weight: '0.3'
53
+ - model: HuggingFaceH4/zephyr-7b-beta
54
+ parameters:
55
+ density: '0.53'
56
+ weight: '0.3'
57
+ parameters:
58
+ int8_mask: true
59
+
60
+ ```