Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Xiaodong
/
Next-DPO-iter1
like
0
Safetensors
Xiaodong/DPO_sdf_17k
Model card
Files
Files and versions
Community
4a80429
Next-DPO-iter1
2 contributors
History:
3 commits
Xiaodong
Create README.md
4a80429
verified
4 months ago
checkpoint-3000
upload DPO-reproduce checkpoint
4 months ago
.gitattributes
1.52 kB
initial commit
4 months ago
README.md
83 Bytes
Create README.md
4 months ago