Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -16,19 +16,22 @@ tags:
 Model Card for Loquace-12B
-# Loquace - An Italian instruction finetuned LargeLanguage model.
 ## Model Description
-Loquace-12B is a fine-tuned conversational model for the Italian language. It has been trained on a dataset of 102,000 question/answer examples in the Alpaca style. The model is based on the Falcon-12B architecture and was fine-tuned using the qload framework.
 ## Usage
-## Training Data
-Loquace-12B was trained on a conversational dataset comprising 102,000 question/answer pairs in Italian.
-The training data was formatted in the Alpaca style, which emphasizes conversational exchanges.
-The specific sources and characteristics of the training data are not disclosed.
 ## Limitations

 Model Card for Loquace-12B
+# Loquace 🇮🇹 An Italian instruction finetuned LargeLanguage model. 🇮🇹
 ## Model Description
+Loquace-12B is the first 12B italian Large Language Model trained using QLoRa on a large dataset of 102k question/answer pairs
+exclusively in Italian.
 ## Usage
+## Training
+Loquace-12B was trained on a conversational dataset comprising 102k question/answer pairs in Italian language.
+The training data was constructed by putting together translations from the original alpaca Dataset and other sources like the OpenAssistant dataset.
+The model was trained for only 3000 iterations and took 18 hours on a single RTX 3090, kindly provided by Genesis Cloud.
 ## Limitations