IIC
/

Text Generation
Transformers
Safetensors
Spanish
qwen2
chat
conversational
text-generation-inference
Inference Endpoints
gonzalo-santamaria-iic commited on
Commit
1e791b6
·
verified ·
1 Parent(s): 3e7c156

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -2
README.md CHANGED
@@ -162,7 +162,7 @@ For a better experience, we recommend using [the following generation parameters
162
 
163
  ### Training Data
164
 
165
- A combination of both public and private datasets designed in the IIC. The dataset consists of 21975 conversations in Spanish, with the format `chatml` and has the same structure as the [Anthropic/hh-rlhf dataset](https://huggingface.co/datasets/Anthropic/hh-rlhf). Each conversation has two variants: `chosen` and `rejected`, and only differs the last answer of the assistant. The last answer in the `chosen` variant is considered a better answer than the one in the `rejected` variant. Different techniques have been used to generate the dataset, which we explain in depth in the paper (**coming soon**).
166
 
167
  ### Training Procedure
168
 
@@ -224,7 +224,11 @@ As can be seen in the time used, in eight and a half hours we have managed to im
224
 
225
  ## Evaluation
226
 
227
- <!-- This section describes the evaluation protocols and provides the results. -->
 
 
 
 
228
 
229
  ### Testing Data, Factors & Metrics
230
 
 
162
 
163
  ### Training Data
164
 
165
+ A combination of both public and private datasets designed in the IIC. The dataset consists of 21975 conversations in Spanish, with the format `chatml` and has the same structure as the [Anthropic/hh-rlhf dataset](https://huggingface.co/datasets/Anthropic/hh-rlhf). Each conversation has two variants: `chosen` and `rejected`, and only differs the last answer of the assistant. The last answer in the `chosen` variant is considered a better answer than the one in the `rejected` variant. Different techniques have been used to generate the dataset, which we explain in depth in the research (**coming soon**).
166
 
167
  ### Training Procedure
168
 
 
224
 
225
  ## Evaluation
226
 
227
+ To evaluate, we use the following datasets:
228
+
229
+ 1. [IIC/AQuAS](https://huggingface.co/datasets/IIC/AQuAS).
230
+ 2. [IIC/RagQuAS](https://huggingface.co/datasets/IIC/RagQuAS).
231
+ 3. privados
232
 
233
  ### Testing Data, Factors & Metrics
234