SivilTaram
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ This is a collection of the language models trained using Pile-CC, each with app
|
|
19 |
- **Model Size**: 5 separate models trained with different seeds, each with ~1B parameters
|
20 |
- **Training Data**: Human selection (from The Pile paper) data mixtures on the [RegMix-Data](https://huggingface.co/datasets/sail/regmix-data) dataset
|
21 |
- **Purpose**: The Human selection is a strong baseline for our method RegMix
|
22 |
-
|
23 |
## Dataset
|
24 |
|
25 |
The models were trained using the [RegMix-Data](https://huggingface.co/datasets/sail/regmix-data) dataset, which is split into different domains from The Pile dataset.
|
|
|
19 |
- **Model Size**: 5 separate models trained with different seeds, each with ~1B parameters
|
20 |
- **Training Data**: Human selection (from The Pile paper) data mixtures on the [RegMix-Data](https://huggingface.co/datasets/sail/regmix-data) dataset
|
21 |
- **Purpose**: The Human selection is a strong baseline for our method RegMix
|
22 |
+
|
23 |
## Dataset
|
24 |
|
25 |
The models were trained using the [RegMix-Data](https://huggingface.co/datasets/sail/regmix-data) dataset, which is split into different domains from The Pile dataset.
|