andreaskoepf
commited on
Commit
·
934bda9
1
Parent(s):
234fb92
Update README.md
Browse files
README.md
CHANGED
@@ -1,6 +1,13 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
- wandb: https://wandb.ai/open-assistant/supervised-finetuning/runs/sytsyhrp
|
5 |
- [sampling report](https://open-assistant.github.io/oasst-model-eval/?f=https%3A%2F%2Fraw.githubusercontent.com%2FOpen-Assistant%2Foasst-model-eval%2Fmain%2Fsampling_reports%2Foasst-pretrained%2F2023-05-05_OpenAssistant_pythia-12b-pre-v8-12_5k-steps_sampling_noprefix2.json)
|
6 |
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
+
|
5 |
+
**Note: internal model, not ready for use**
|
6 |
+
|
7 |
+
This is an intermediate model used as base-model for further pythia 12b SFT-8 experiments.
|
8 |
+
It was trained on a wider set of instruction-tuning datasets for >12.5k steps with batch-size 128 and a context size of 2048.
|
9 |
+
|
10 |
+
|
11 |
- wandb: https://wandb.ai/open-assistant/supervised-finetuning/runs/sytsyhrp
|
12 |
- [sampling report](https://open-assistant.github.io/oasst-model-eval/?f=https%3A%2F%2Fraw.githubusercontent.com%2FOpen-Assistant%2Foasst-model-eval%2Fmain%2Fsampling_reports%2Foasst-pretrained%2F2023-05-05_OpenAssistant_pythia-12b-pre-v8-12_5k-steps_sampling_noprefix2.json)
|
13 |
|