jed351
/

gpt2_tiny_zh-hk-shikoto

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jed351 commited on Jan 28, 2023

Commit

006bf74

·

1 Parent(s): 86a5699

Update README.md

Files changed (1) hide show

README.md +10 -11

README.md CHANGED Viewed

@@ -25,24 +25,23 @@ should probably proofread and complete it, then remove this comment. -->
 # gpt2-shikoto
-This model is a fine-tuned version of [jed351/gpt2-tiny-zh-hk](https://huggingface.co/jed351/gpt2-tiny-zh-hk) on the jed351/shikoto_zh_hk dataset.
-It achieves the following results on the evaluation set:
-- Loss: 3.2965
-- Accuracy: 0.3738
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters

 # gpt2-shikoto
+This model was trained on a dataset I obtained from an online novel site.
+**Please be aware that the stories might contain inappropriate content**
+The base model can be found [here](https://huggingface.co/jed351/gpt2-tiny-zh-hk), which was obtained from
+patching a GPT2 Chinese model and its tokenizer with Cantonese characters.
+## Training procedure
+Please refer to the [script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling)
+provided by Huggingface.
+The model was trained for 400,000 steps on 2 NVIDIA Quadro RTX6000 for around 15 hours.
 ### Training hyperparameters