freewheelin commited on
Commit
22b9902
·
verified ·
1 Parent(s): 09bc8ba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -11
README.md CHANGED
@@ -16,17 +16,17 @@ but this kicked away. maybe the explanation was not enough.
16
  - We were inspired by this [Sakana project](https://sakana.ai/evolutionary-model-merge/)
17
 
18
  ## Process
19
- - you need two models with the same architecture
20
- - 1. choose one model and finetune the model to make a gap between the original one and fine-tuned one. it doesn't matter the evaluation score is higher or lower.
21
- - 2. merge two of them
22
- - 3. evaluate the merged model
23
- - 4. finetune a specific evaluation part if you need to increase score of the part of the model. (sure it's not gonna work like you think. but try it)
24
- - 5. merge again
25
- - 6. evaluate again
26
- - 7. keep going until evaluate avg is higher then original one
27
-
28
- that's it. simple.
29
- you can make a framework to do this automatically.
30
 
31
  ## Base Architecture
32
  - QWEN2
 
16
  - We were inspired by this [Sakana project](https://sakana.ai/evolutionary-model-merge/)
17
 
18
  ## Process
19
+ You need two models with the same architecture.
20
+ - Choose one model and fine-tune it to create a gap between the original model and the fine-tuned one. It doesn't matter whether the evaluation score is higher or lower.
21
+ - Merge the two models.
22
+ - Evaluate the merged model.
23
+ - Fine-tune a specific evaluation part of the model if you need to increase the score for that part. (It's unlikely to work as you think, but you can try it.)
24
+ - Merge the models again.
25
+ - Evaluate again.
26
+ - Keep going until the average evaluation score is higher than the original one.
27
+
28
+ That's it. Simple.
29
+ You can create a framework to automate this process.
30
 
31
  ## Base Architecture
32
  - QWEN2