Next-DPO-iter1 / checkpoint-3000 /trainer_state.json

Commit History

upload DPO-reproduce checkpoint
f974a11

Wang-Xiaodong1899 commited on