Amu
/

spin-phi2

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Amu commited on Mar 3, 2024

Commit

c27dc3b

·

verified ·

1 Parent(s): ed108b7

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -16,8 +16,8 @@ This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/m
 I think SPIN not only can use on a SFT model, but also it  can use on a pretrained model.
 Therefore, I use SPIN on a pretrained model microsoft/phi-2. And I get a higher score better than origin pretrained model. You can check the [open llm leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
-The best paradigm for training a conversational Large Language Model (LLM):
-pretrain -> dpo(spin) -> sft -> dpo(spin)
 ## Training procedure

 I think SPIN not only can use on a SFT model, but also it  can use on a pretrained model.
 Therefore, I use SPIN on a pretrained model microsoft/phi-2. And I get a higher score better than origin pretrained model. You can check the [open llm leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
+**I Think the best paradigm for training a conversational Large Language Model (LLM):
+pretrain -> dpo(spin) -> sft -> dpo(spin)**
 ## Training procedure