XueyingJia
/

qwen-0.5b-sft-HH-online-dpo-ground-truth-lead

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

qwen-0.5b-sft-HH-online-dpo-ground-truth-lead / runs /Dec10_17-26-47_babel-0-23

1 contributor

History: 24 commits

XueyingJia's picture

Training in progress, step 2699

02d718e verified about 2 months ago

events.out.tfevents.1733869613.babel-0-23.2782862.0

134 kB
LFS

Training in progress, step 2699 about 2 months ago