ethzanalytics
/

ai-msgbot-gpt2-XL-dialogue

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Apr 26, 2023

Commit

da20160

•

1 Parent(s): fd349cd

Update README.md

Files changed (1) hide show

README.md +3 -8

README.md CHANGED Viewed

@@ -1,5 +1,4 @@
 ---
 language:
 - en
 tags:
@@ -8,8 +7,7 @@ tags:
 - gpt
 license: mit
 datasets:
-- natural questions
 widget:
 - text: "Do you like my new haircut?\nperson beta:\n\n"
   example_title: "haircut"
@@ -19,7 +17,6 @@ widget:
   example_title: "favorite"
 - text: "how much does it cost?\nperson beta:\n\n"
   example_title: "money"
 inference:
   parameters:
     min_length: 2
@@ -30,12 +27,10 @@ inference:
     top_p: 0.85
     top_k: 10
     repetition_penalty: 2.1
 ---
-# ai-msgbot GPT2-XL-dialogue
-_NOTE: model card is WIP_
 GPT2-XL (~1.5 B parameters) trained on [the Wizard of Wikipedia dataset](https://parl.ai/projects/wizard_of_wikipedia/) for 40k steps with **33**/36 layers frozen using `aitextgen`. The resulting model was then **further fine-tuned** on the [Daily Dialogues](http://yanran.li/dailydialog) for 40k steps, with **34**/36 layers frozen.

 ---
 language:
 - en
 tags:
 - gpt
 license: mit
 datasets:
+- natural_questions
 widget:
 - text: "Do you like my new haircut?\nperson beta:\n\n"
   example_title: "haircut"
   example_title: "favorite"
 - text: "how much does it cost?\nperson beta:\n\n"
   example_title: "money"
 inference:
   parameters:
     min_length: 2
     top_p: 0.85
     top_k: 10
     repetition_penalty: 2.1
 ---
+# ai-msgbot: GPT2-XL-dialogue
 GPT2-XL (~1.5 B parameters) trained on [the Wizard of Wikipedia dataset](https://parl.ai/projects/wizard_of_wikipedia/) for 40k steps with **33**/36 layers frozen using `aitextgen`. The resulting model was then **further fine-tuned** on the [Daily Dialogues](http://yanran.li/dailydialog) for 40k steps, with **34**/36 layers frozen.