Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
yangzhao02
/
mistral-7b-dpo-all_pairs
like
0
Safetensors
yangzhao02/ListUltraFeedback
mistral
alignment-handbook
ndcg
trl
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
b5c6dab
mistral-7b-dpo-all_pairs
Commit History
End of training
b5c6dab
verified
yangzhao02
commited on
Aug 19, 2024
Model save
2205144
verified
yangzhao02
commited on
Aug 19, 2024
Training in progress, step 467
cda9026
verified
yangzhao02
commited on
Aug 19, 2024
Training in progress, step 375
60955a9
verified
yangzhao02
commited on
Aug 19, 2024
Training in progress, step 250
b39676a
verified
yangzhao02
commited on
Aug 18, 2024
Training in progress, step 125
76223c6
verified
yangzhao02
commited on
Aug 18, 2024
initial commit
454af02
verified
yangzhao02
commited on
Aug 18, 2024