aashish1904 commited on
Commit
00492e3
·
verified ·
1 Parent(s): 2877d09

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +60 -0
README.md ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ base_model:
5
+ - flammenai/Mahou-1.3-mistral-nemo-12B
6
+ - nbeerbower/mistral-nemo-gutenberg-12B-v3
7
+ library_name: transformers
8
+ tags:
9
+ - mergekit
10
+ - merge
11
+
12
+
13
+ ---
14
+
15
+ ![](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)
16
+
17
+ # QuantFactory/Mahou-Gutenberg-Nemo-12B-GGUF
18
+ This is quantized version of [nbeerbower/Mahou-Gutenberg-Nemo-12B](https://huggingface.co/nbeerbower/Mahou-Gutenberg-Nemo-12B) created using llama.cpp
19
+
20
+ # Original Model Card
21
+
22
+ # Mahou-Gutenberg-Nemo-12B
23
+
24
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
25
+
26
+ ## Merge Details
27
+ ### Merge Method
28
+
29
+ This model was merged using the della_linear merge method using [flammenai/Mahou-1.3-mistral-nemo-12B](https://huggingface.co/flammenai/Mahou-1.3-mistral-nemo-12B) as a base.
30
+
31
+ ### Models Merged
32
+
33
+ The following models were included in the merge:
34
+ * [nbeerbower/mistral-nemo-gutenberg-12B-v3](https://huggingface.co/nbeerbower/mistral-nemo-gutenberg-12B-v3)
35
+
36
+ ### Configuration
37
+
38
+ The following YAML configuration was used to produce this model:
39
+
40
+ ```yaml
41
+ models:
42
+ - model: flammenai/Mahou-1.3-mistral-nemo-12B
43
+ parameters:
44
+ weight: 0.5
45
+ density: 0.8
46
+ - model: nbeerbower/mistral-nemo-gutenberg-12B-v3
47
+ parameters:
48
+ weight: 0.5
49
+ density: 0.8
50
+ merge_method: della_linear
51
+ base_model: flammenai/Mahou-1.3-mistral-nemo-12B
52
+ parameters:
53
+ epsilon: 0.05
54
+ lambda: 1
55
+ int8_mask: true
56
+ dtype: bfloat16
57
+ tokenzer_source: union
58
+
59
+ ```
60
+