freewheelin
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -16,17 +16,17 @@ but this kicked away. maybe the explanation was not enough.
|
|
16 |
- We were inspired by this [Sakana project](https://sakana.ai/evolutionary-model-merge/)
|
17 |
|
18 |
## Process
|
19 |
-
|
20 |
-
-
|
21 |
-
-
|
22 |
-
-
|
23 |
-
-
|
24 |
-
-
|
25 |
-
-
|
26 |
-
-
|
27 |
-
|
28 |
-
|
29 |
-
|
30 |
|
31 |
## Base Architecture
|
32 |
- QWEN2
|
|
|
16 |
- We were inspired by this [Sakana project](https://sakana.ai/evolutionary-model-merge/)
|
17 |
|
18 |
## Process
|
19 |
+
You need two models with the same architecture.
|
20 |
+
- Choose one model and fine-tune it to create a gap between the original model and the fine-tuned one. It doesn't matter whether the evaluation score is higher or lower.
|
21 |
+
- Merge the two models.
|
22 |
+
- Evaluate the merged model.
|
23 |
+
- Fine-tune a specific evaluation part of the model if you need to increase the score for that part. (It's unlikely to work as you think, but you can try it.)
|
24 |
+
- Merge the models again.
|
25 |
+
- Evaluate again.
|
26 |
+
- Keep going until the average evaluation score is higher than the original one.
|
27 |
+
|
28 |
+
That's it. Simple.
|
29 |
+
You can create a framework to automate this process.
|
30 |
|
31 |
## Base Architecture
|
32 |
- QWEN2
|