gonzalo-santamaria-iic
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -162,7 +162,7 @@ For a better experience, we recommend using [the following generation parameters
|
|
162 |
|
163 |
### Training Data
|
164 |
|
165 |
-
A combination of both public and private datasets designed in the IIC. The dataset consists of 21975 conversations in Spanish, with the format `chatml` and has the same structure as the [Anthropic/hh-rlhf dataset](https://huggingface.co/datasets/Anthropic/hh-rlhf). Each conversation has two variants: `chosen` and `rejected`, and only differs the last answer of the assistant. The last answer in the `chosen` variant is considered a better answer than the one in the `rejected` variant. Different techniques have been used to generate the dataset, which we explain in depth in the
|
166 |
|
167 |
### Training Procedure
|
168 |
|
@@ -224,7 +224,11 @@ As can be seen in the time used, in eight and a half hours we have managed to im
|
|
224 |
|
225 |
## Evaluation
|
226 |
|
227 |
-
|
|
|
|
|
|
|
|
|
228 |
|
229 |
### Testing Data, Factors & Metrics
|
230 |
|
|
|
162 |
|
163 |
### Training Data
|
164 |
|
165 |
+
A combination of both public and private datasets designed in the IIC. The dataset consists of 21975 conversations in Spanish, with the format `chatml` and has the same structure as the [Anthropic/hh-rlhf dataset](https://huggingface.co/datasets/Anthropic/hh-rlhf). Each conversation has two variants: `chosen` and `rejected`, and only differs the last answer of the assistant. The last answer in the `chosen` variant is considered a better answer than the one in the `rejected` variant. Different techniques have been used to generate the dataset, which we explain in depth in the research (**coming soon**).
|
166 |
|
167 |
### Training Procedure
|
168 |
|
|
|
224 |
|
225 |
## Evaluation
|
226 |
|
227 |
+
To evaluate, we use the following datasets:
|
228 |
+
|
229 |
+
1. [IIC/AQuAS](https://huggingface.co/datasets/IIC/AQuAS).
|
230 |
+
2. [IIC/RagQuAS](https://huggingface.co/datasets/IIC/RagQuAS).
|
231 |
+
3. privados
|
232 |
|
233 |
### Testing Data, Factors & Metrics
|
234 |
|