adamo1139
/

Yi-34b-200K-rawrr-v2-run-0902-LoRA

Model card Files Files and versions Community

adamo1139 commited on Feb 9, 2024

Commit

3fbaa29

·

verified ·

1 Parent(s): ec0716e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ tags:
 This is not an instruct fine tune, instead it's an attempt to de-contaminate the model, remove gptslop and refusals. I want model to feel like it was trained on human data, not synthetic one.
-About 961 steps total, Yi-34B-200K llamafied DPO trained for 1 epoch on rawrr_v2 dataset via unsloth at prompt length of 400 and max length of 700, lr 0.000045 \
 Model initialized with max_positional_embeddings of 4096 to not OOM. \
 Training done on RTX 3090 Ti in about 14 hours. \
 Average mem usage was like 23.89 / 23.99 GiB, so very close to OOM at all times. \

 This is not an instruct fine tune, instead it's an attempt to de-contaminate the model, remove gptslop and refusals. I want model to feel like it was trained on human data, not synthetic one.
+About 961 steps total, Yi-34B-200K llamafied DPO trained for 1 epoch on rawrr_v2 dataset via unsloth qlora at prompt length of 400 and max length of 700, lr 0.000045 \
 Model initialized with max_positional_embeddings of 4096 to not OOM. \
 Training done on RTX 3090 Ti in about 14 hours. \
 Average mem usage was like 23.89 / 23.99 GiB, so very close to OOM at all times. \