austindavis
/

gpt2-lichess-uci-2016-01_11

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

austindavis commited on May 11, 2024

Commit

bb72387

·

verified ·

1 Parent(s): 74a6b58

End of training

Files changed (3) hide show

README.md +12 -10
generation_config.json +6 -3
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,24 +1,20 @@
 ---
 tags:
 - generated_from_trainer
-widget:
-- text: e2e4
-  example_title: King's pawn
-- text: d2d4
-  example_title: Queen's pawn
 model-index:
-- name: austindavis/gpt2-pretrained-lichess-uci
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# gpt2-pretrained-lichess-uci-finetuned-lichess-uci
-This model is a Pretrained GPT-2 trained on an the Lichess UCI dataset from Feb 2013.
 It achieves the following results on the evaluation set:
-- Loss: 1.3084
 ## Model description
@@ -37,7 +33,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0002
 - train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
@@ -45,6 +41,12 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: cosine
 - num_epochs: 1
 ### Framework versions

 ---
+base_model: austindavis/gpt2-lichess-uci-201601
 tags:
 - generated_from_trainer
 model-index:
+- name: gpt2-lichess-uci-2016-01_11
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# gpt2-lichess-uci-2016-01_11
+This model is a fine-tuned version of [austindavis/gpt2-lichess-uci-201601](https://huggingface.co/austindavis/gpt2-lichess-uci-201601) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0379
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001715755714441261
 - train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - lr_scheduler_type: cosine
 - num_epochs: 1
+### Training results
+| Training Loss | Epoch | Step   | Validation Loss |
+|:-------------:|:-----:|:------:|:---------------:|
+| 1.0634        | 1.0   | 266171 | 1.0379          |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -1,6 +1,9 @@
 {
-  "_from_model_config": true,
-  "bos_token_id": 1,
   "eos_token_id": 2,
   "transformers_version": "4.40.1"
-}

 {
+  "do_sample": true,
   "eos_token_id": 2,
+  "max_length": 128,
+  "max_new_tokens": 128,
+  "pad_token_id": 0,
+  "temperature": 0.0001,
   "transformers_version": "4.40.1"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d4f7485b1a913fd8cfbf70278c5bfe5ae943d596f79e937950229b97ded75469
 size 102086376

 version https://git-lfs.github.com/spec/v1
+oid sha256:9f610df8ff91108cf4949c4c4057ebebc783fedb8857c07f4ab5b5528d88f681
 size 102086376