Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
attention-avengers
's Collections
DPO Training
DPO Training
updated
Jun 12, 2024
It contains Qwen1.5-0.5B-Chat version that has been retrained using EPFL data and ...
Upvote
-
attention-avengers/Qwen1.5-0.5B-Chat-EPFL-ORCA-DPO
Text Generation
•
Updated
May 30, 2024
•
1
attention-avengers/Qwen1.5-0.5B-Chat-ORCA-EPFL-cDPO
Text Generation
•
Updated
May 30, 2024
•
3
attention-avengers/Qwen1.5-0.5B-Chat-EPFL-ORCA-cDPO
Text Generation
•
Updated
May 29, 2024
•
6
attention-avengers/Qwen1.5-0.5B-Chat-EPFL-cDPO
Text Generation
•
Updated
May 28, 2024
•
2
attention-avengers/Qwen1.5-0.5B-Chat-ORCA-cDPO
Text Generation
•
Updated
May 29, 2024
•
1
Upvote
-
Share collection
View history
Collection guide
Browse collections