metadata
language: ta
datasets:
- oscar
- IndicNLP
- Wiki-Tamil novels scrapped data
widget:
- text: ஆதித்த கரிகாலர் தஞ்சைக்குச் செல்ல உடனடியாக ஒப்புக்கொண்டார்.
- text: 'நந்தினி பெரிய பழுவேட்டரையரை உண்மையாக நேசித்தால் '
- text: 'மதுராந்தகருக்கு இராஜ்யமாளும் விருப்பம் இருப்பதாக இல்லை '
GPT2-Kalki
Model description
GPT2-Kalki is a GPT-2 transformer model fine-tuned on corpus of Tamil language data from Wikipedia. Has been specifically finetuned on the works of Kalki Krishnamurthy - a Tamil writer from the 1900s. This model is an experimentation of "What if" scenarios using the characters of his novels. The famous movie that has been released now Ponniyin Selvan - I is based on the novel written by the same author. This model is trained on an already trained model on Tamil dataset from GPT2-Tamil.
Dataset Used:
The GTP-2 model is trained on oscar dataset - ta and IndicNLP dataset - ta and manually scrapped Wikipedia dataset specifically having stories and novels. The scrapped dataset will be released soon.