ITG
/

DialoGPT-medium-spanish-chitchat

@@ -9,6 +9,10 @@ tags:
 - gpt
 - gpt2
 - text-generation
 inference: false
 ---
@@ -23,7 +27,7 @@ We used one of the datasets available in the [Bot Framework Tools repository](ht
 ## Example inference script
-### Check at this example script to run this model in inference mode
 ```python
 import torch
@@ -84,7 +88,7 @@ for i in range(CHAT_TURNS):
 |         Warmup training steps (%)        |             6%              |
 |               Weight decay               |             0.01            |
 | Optimiser (beta1, beta2, epsilon) | AdamW (0.9, 0.999, 1e-08) |
-|  Monitoring metric (delta, patience)     |   validation loss (0.1, 3)  |
 ## Fine-tuning in a different dataset or style
@@ -94,6 +98,7 @@ You can check the [original GitHub repository](https://github.com/microsoft/Dial
 ## Limitations
 - This model is intended to be used **just for single-turn chitchat conversations in Spanish**.
 - This model's generation capabilities are limited to the extent of the aforementioned fine-tuning dataset.
 - This model generates short answers, providing general context dialogue in a professional style.

 - gpt
 - gpt2
 - text-generation
+- spanish
+- dialogpt
+- chitchat
+- ITG
 inference: false
 ---
 ## Example inference script
+### Check at this example script to run our model in inference mode
 ```python
 import torch
 |         Warmup training steps (%)        |             6%              |
 |               Weight decay               |             0.01            |
 | Optimiser (beta1, beta2, epsilon) | AdamW (0.9, 0.999, 1e-08) |
+|  Monitoring metric (delta, patience)     |   Validation loss (0.1, 3)  |
 ## Fine-tuning in a different dataset or style
 ## Limitations
+- This model uses the original English-based tokenizer from [GPT-2 paper](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf). Spanish tokenization is not considered but it has similarities in grammatical structure for encoding text. This overlap may help the model transfer its knowledge from English to Spanish.
 - This model is intended to be used **just for single-turn chitchat conversations in Spanish**.
 - This model's generation capabilities are limited to the extent of the aforementioned fine-tuning dataset.
 - This model generates short answers, providing general context dialogue in a professional style.