Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
MYC081
/
Qwen2.5-3B-WPO-bf16-1
like
0
Text Generation
Transformers
TensorBoard
Safetensors
trl-lib/ultrafeedback-prompt
qwen2
Generated from Trainer
trl
xpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2405.21046
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
Qwen2.5-3B-WPO-bf16-1
Commit History
End of training
accac17
verified
MYC081
commited on
Nov 15, 2024
Model save
329110b
verified
MYC081
commited on
Nov 15, 2024
Training in progress, step 3540
aca4918
verified
MYC081
commited on
Nov 15, 2024
Training in progress, step 3500
782cb7f
verified
MYC081
commited on
Nov 15, 2024
Training in progress, step 3000
bde4a6d
verified
MYC081
commited on
Nov 15, 2024
Training in progress, step 2500
cb0f36e
verified
MYC081
commited on
Nov 15, 2024
Training in progress, step 2000
c52e5e5
verified
MYC081
commited on
Nov 14, 2024
Training in progress, step 1500
936b222
verified
MYC081
commited on
Nov 14, 2024
Training in progress, step 1000
0aea0c8
verified
MYC081
commited on
Nov 14, 2024
Training in progress, step 500
467ff07
verified
MYC081
commited on
Nov 14, 2024
initial commit
5446cc9
verified
MYC081
commited on
Nov 14, 2024