Llama-2-7b-sft-DPO-2k / last-checkpoint
AmberYifan's picture
Training in progress, epoch 3, checkpoint
f9dcd3f verified