TeeZee commited on
Commit
81359e6
·
verified ·
1 Parent(s): 2a2a8f7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -3,3 +3,13 @@ license: other
3
  license_name: microsoft
4
  license_link: https://huggingface.co/microsoft/Orca-2-13b/blob/main/LICENSE
5
  ---
 
 
 
 
 
 
 
 
 
 
 
3
  license_name: microsoft
4
  license_link: https://huggingface.co/microsoft/Orca-2-13b/blob/main/LICENSE
5
  ---
6
+
7
+ Lost of good models use Orca for their merges, however vanilla Orca has vocabulary size of 32003,
8
+ 3 last tokens are ChatML tokens and a PAD token. This causes errors during a merge with models with standard 32000 vocabulary size.
9
+
10
+ I've removed those tokens from volabulary and resized model embeddings to mach 32000 standard size. So this model is ready to be used
11
+ as a merge component in mergekit. It may not work on its own with ChatML template anymore.
12
+
13
+ ```python
14
+ model.resize_token_embeddings(32000)
15
+ ```