Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JayHyeon
/
Qwen_math-MDPO_0.5_5e-7-1ep_0alp_0lam
like
0
Feature Extraction
Transformers
Safetensors
openbmb/UltraInteract_pair
qwen2
Generated from Trainer
trl
dpo
text-generation-inference
text-embeddings-inference
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen_math-MDPO_0.5_5e-7-1ep_0alp_0lam
Commit History
Upload model
1084ab5
verified
JayHyeon
commited on
22 days ago
End of training
942b90c
verified
JayHyeon
commited on
22 days ago
Model save
8e880c1
verified
JayHyeon
commited on
22 days ago
Training in progress, step 3430
ed0c5f3
verified
JayHyeon
commited on
22 days ago
Training in progress, step 3000
cfef518
verified
JayHyeon
commited on
22 days ago
Training in progress, step 2500
cb897b8
verified
JayHyeon
commited on
22 days ago
Training in progress, step 2000
99ddced
verified
JayHyeon
commited on
22 days ago
Training in progress, step 1500
cd0c531
verified
JayHyeon
commited on
22 days ago
Training in progress, step 1000
a039d56
verified
JayHyeon
commited on
22 days ago
Training in progress, step 500
e5cf29c
verified
JayHyeon
commited on
22 days ago
Training in progress, step 2500
a1eb960
verified
JayHyeon
commited on
23 days ago
Training in progress, step 2000
7d843e1
verified
JayHyeon
commited on
23 days ago
Training in progress, step 1500
43e6fc9
verified
JayHyeon
commited on
23 days ago
Training in progress, step 1000
f7b0685
verified
JayHyeon
commited on
23 days ago
Training in progress, step 500
d4df281
verified
JayHyeon
commited on
23 days ago
initial commit
704c41f
verified
JayHyeon
commited on
24 days ago