Update README.md
Browse files
README.md
CHANGED
@@ -29,11 +29,11 @@ We will release more details in the upcoming technical report.
|
|
29 |
- **Model type:** A 7B parameter model fine-tuned on a mix of publicly available, synthetic datasets on translation-related tasks, as well as conversational datasets and code instructions.
|
30 |
- **Language(s) (NLP):** English, Portuguese, Spanish, French, German, Dutch, Italian, Korean, Chinese, Russian
|
31 |
- **License:** CC-BY-NC-4.0
|
32 |
-
- **Finetuned from model:** TowerBase
|
33 |
|
34 |
## Intended uses & limitations
|
35 |
|
36 |
-
The model was initially fine-tuned on a filtered and preprocessed supervised fine-tuning dataset (TowerBlocks
|
37 |
- Translation
|
38 |
- Automatic Post Edition
|
39 |
- Machine Translation Evaluation
|
@@ -45,7 +45,7 @@ The model was initially fine-tuned on a filtered and preprocessed supervised fin
|
|
45 |
- Synthetic Chat data
|
46 |
- Code instructions
|
47 |
|
48 |
-
You can find the dataset and all data sources of TowerBlocks
|
49 |
|
50 |
Here's how you can run the model using the `pipeline()` function from 🤗 Transformers:
|
51 |
|
@@ -95,13 +95,13 @@ TowerInstruct-v0.1 was trained using the ChatML prompt templates without any sys
|
|
95 |
|
96 |
### Supervised tasks
|
97 |
|
98 |
-
The prompts for all supervised tasks can be found in TowerBlocks
|
99 |
|
100 |
## Training Details
|
101 |
|
102 |
### Training Data
|
103 |
|
104 |
-
Link to TowerBlocks
|
105 |
|
106 |
#### Training Hyperparameters
|
107 |
|
|
|
29 |
- **Model type:** A 7B parameter model fine-tuned on a mix of publicly available, synthetic datasets on translation-related tasks, as well as conversational datasets and code instructions.
|
30 |
- **Language(s) (NLP):** English, Portuguese, Spanish, French, German, Dutch, Italian, Korean, Chinese, Russian
|
31 |
- **License:** CC-BY-NC-4.0
|
32 |
+
- **Finetuned from model:** [TowerBase](https://huggingface.co/datasets/Unbabel/TowerBlocks-v0.1)
|
33 |
|
34 |
## Intended uses & limitations
|
35 |
|
36 |
+
The model was initially fine-tuned on a filtered and preprocessed supervised fine-tuning dataset ([TowerBlocks](https://huggingface.co/datasets/Unbabel/TowerBlocks-v0.1)), which contains a diverse range of data sources:
|
37 |
- Translation
|
38 |
- Automatic Post Edition
|
39 |
- Machine Translation Evaluation
|
|
|
45 |
- Synthetic Chat data
|
46 |
- Code instructions
|
47 |
|
48 |
+
You can find the dataset and all data sources of [TowerBlocks](https://huggingface.co/datasets/Unbabel/TowerBlocks-v0.1) here.
|
49 |
|
50 |
Here's how you can run the model using the `pipeline()` function from 🤗 Transformers:
|
51 |
|
|
|
95 |
|
96 |
### Supervised tasks
|
97 |
|
98 |
+
The prompts for all supervised tasks can be found in [TowerBlocks](https://huggingface.co/datasets/Unbabel/TowerBlocks-v0.1). We have used multiple prompt templates for each task. While different prompts may offer different outputs, the difference in downstream performance should be very minimal.
|
99 |
|
100 |
## Training Details
|
101 |
|
102 |
### Training Data
|
103 |
|
104 |
+
Link to [TowerBlocks](https://huggingface.co/datasets/Unbabel/TowerBlocks-v0.1).
|
105 |
|
106 |
#### Training Hyperparameters
|
107 |
|