Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
XueyingJia
/
qwen-0.5b-sft-HH-online-dpo-ground-truth-lead
like
0
Transformers
TensorBoard
Safetensors
XueyingJia/hh-rlhf-train
Generated from Trainer
trl
online-dpo
Inference Endpoints
arxiv:
2402.04792
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
qwen-0.5b-sft-HH-online-dpo-ground-truth-lead
/
runs
Commit History
Training in progress, step 2699
02d718e
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2600
0f62e78
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2500
08baf3c
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2400
36aa20e
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2300
062148d
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2200
ff63b67
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2100
4e2f5ca
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 2000
da3ea19
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1900
fc2437f
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1800
faad782
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1700
0881deb
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1600
1539c32
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1500
b684d78
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1400
753c0d4
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1300
1b32f71
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1200
8e0e35a
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1100
f784911
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 1000
cf4deb1
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 900
e326718
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 800
8731207
verified
XueyingJia
commited on
Dec 11, 2024
Training in progress, step 700
028b86d
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 600
bf214d6
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 500
02f9b88
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 400
f2619d6
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 300
215b313
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 200
9c5c90c
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 100
0e62120
verified
XueyingJia
commited on
Dec 10, 2024