SivilTaram
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ tags:
|
|
10 |
---
|
11 |
|
12 |
|
13 |
-
# Models Trained with
|
14 |
|
15 |
This is a collection of the language models trained using DoReMi data mxiture, each with approximately 1B parameters, trained on different random mixtures of data. This models aims to server as the strong baseline for our RegMix approach (https://huggingface.co/papers/2407.01492).
|
16 |
|
|
|
10 |
---
|
11 |
|
12 |
|
13 |
+
# Models Trained with DoReMi Data Mixture
|
14 |
|
15 |
This is a collection of the language models trained using DoReMi data mxiture, each with approximately 1B parameters, trained on different random mixtures of data. This models aims to server as the strong baseline for our RegMix approach (https://huggingface.co/papers/2407.01492).
|
16 |
|