ITG
/

DialoGPT-medium-spanish-chitchat

@@ -20,8 +20,13 @@ inference: false
 ## Description
-This is a **transformer-decoder** [GPT-2 model](https://huggingface.co/gpt2), adapted for **single-turn dialogue tasks in Spanish**. We fine-tuned a [DialoGPT-medium](https://huggingface.co/microsoft/DialoGPT-medium) 345M parameters model from Microsoft, following the CLM (Causal Language Modelling) objective.
-We used one of the datasets available in the [Bot Framework Tools repository](https://github.com/microsoft/botframework-cli). We processed [the professional-styled personality chat dataset in Spanish](https://github.com/microsoft/botframework-cli/blob/main/packages/qnamaker/docs/chit-chat-dataset.md), the file is available [here to download](https://qnamakerstore.blob.core.windows.net/qnamakerdata/editorial/spanish/qna_chitchat_professional.tsv)
 ---
@@ -105,4 +110,4 @@ You can check the [original GitHub repository](https://github.com/microsoft/Dial
     > Since our approach can assign a probability to any Unicode string, this allows us to evaluate our LMs on any dataset regardless of pre-processing, tokenization, or vocab size.
 - This model is intended to be used **just for single-turn chitchat conversations in Spanish**.
 - This model's generation capabilities are limited to the extent of the aforementioned fine-tuning dataset.
-- This model generates short answers, providing general context dialogue in a professional style.

 ## Description
+This is a **transformer-decoder** [GPT-2 model](https://huggingface.co/gpt2), adapted for the **single-turn dialogue task in Spanish**. We fine-tuned a [DialoGPT-medium](https://huggingface.co/microsoft/DialoGPT-medium) 345M parameter model from Microsoft, following the CLM (Causal Language Modelling) objective.
+---
+## Dataset
+We used one of the datasets available in the [Bot Framework Tools repository](https://github.com/microsoft/botframework-cli). We processed [the professional-styled personality chat dataset in Spanish](https://github.com/microsoft/botframework-cli/blob/main/packages/qnamaker/docs/chit-chat-dataset.md), the file is available [to download here](https://qnamakerstore.blob.core.windows.net/qnamakerdata/editorial/spanish/qna_chitchat_professional.tsv)
 ---
     > Since our approach can assign a probability to any Unicode string, this allows us to evaluate our LMs on any dataset regardless of pre-processing, tokenization, or vocab size.
 - This model is intended to be used **just for single-turn chitchat conversations in Spanish**.
 - This model's generation capabilities are limited to the extent of the aforementioned fine-tuning dataset.
+- This model generates short answers, providing general context dialogue in a professional style for the Spanish language.