Update README.md
Browse files
README.md
CHANGED
@@ -28,6 +28,8 @@ It is a FLAN-T5-xl model (3B parameters) finetuned on:
|
|
28 |
1. The [Stanford Human Preferences Dataset (SHP)](https://huggingface.co/datasets/stanfordnlp/SHP), which contains aggregate human preferences sourced from 18 different communities on Reddit (e.g., `askculinary`, `legaladvice`, etc.).
|
29 |
2. The helpfulness data in [Anthropic's HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf) dataset.
|
30 |
|
|
|
|
|
31 |
|
32 |
## Usage
|
33 |
|
|
|
28 |
1. The [Stanford Human Preferences Dataset (SHP)](https://huggingface.co/datasets/stanfordnlp/SHP), which contains aggregate human preferences sourced from 18 different communities on Reddit (e.g., `askculinary`, `legaladvice`, etc.).
|
29 |
2. The helpfulness data in [Anthropic's HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf) dataset.
|
30 |
|
31 |
+
There is a smaller variant called [SteamSHP-Large](https://huggingface.co/kawine/SteamSHP-flan-t5-large) that was made by finetuning FLAN-T5-large (780M parameters), which is 0.75 percentage points less accurate on the test data.
|
32 |
+
|
33 |
|
34 |
## Usage
|
35 |
|