mistral_self_alignment_DPO / adapter_model.safetensors

Commit History

initial adapter with DPO on sample dataset and 1 epoch
36d527d
verified

August4293 commited on