File size: 1,860 Bytes
115f756
 
 
36457d4
115f756
 
 
 
 
 
 
5937f54
 
 
115f756
 
 
13297e9
 
31f66c1
2ad7522
 
31f66c1
 
115f756
 
 
 
 
e65251b
115f756
 
 
 
 
 
 
 
 
 
 
 
 
36457d4
115f756
 
 
 
 
 
 
 
 
 
36457d4
115f756
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
---
base_model:
- wzhouad/gemma-2-9b-it-WPO-HB
- google/gemma-2-9b-it
- princeton-nlp/gemma-2-9b-it-SimPO
library_name: transformers
tags:
- mergekit
- merge

---
# Gemma Advanced V1 (obsolete)

Note: A much-improved version is available at [jsgreenawalt/gemma-2-9B-it-advanced-v2.1](https://huggingface.co/jsgreenawalt/gemma-2-9B-it-advanced-v2.1)

Experimental merge #1, attempting to combine some of the advanced Gemma fine-tunes 

Quants are available here: https://huggingface.co/QuantFactory/gemma-advanced-v1-GGUF

Notes and observations:
* Recommended temperature 0.15 or lower , the model is more temperature sensitive than the parent models
* Recommended Q8_0 quant, Q6* and lower quants lose more than quality than expected
* The model writes coherently (at lower temperatures) and has a different writing style than the parent models

This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

## Merge Details
### Merge Method

This model was merged using the della merge method using [google/google-gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it) as a base.

### Models Merged

The following models were included in the merge:
* [wzhouad/gemma-2-9b-it-WPO-HB](https://huggingface.co/wzhouad/gemma-2-9b-it-WPO-HB)
* [princeton-nlp/gemma-2-9b-it-SimPO](https://huggingface.co/princeton-nlp/gemma-2-9b-it-SimPO)

### Configuration

The following YAML configuration was used to produce this model:

```yaml
models:
  - model: google/gemma-2-9b-it 
    # no parameters necessary for base model
  - model: princeton-nlp/gemma-2-9b-it-SimPO 
    parameters:
      density: 0.5
      weight: 0.5
  - model: wzhouad/gemma-2-9b-it-WPO-HB
    parameters:
      density: 0.5
      weight: 0.5
merge_method: della
base_model: google/gemma-2-9b-it
parameters:
  normalize: true
dtype: float16

```