vwen-0.5 / train_results.json
thangvip's picture
push model
69ce9c4
{
"epoch": 0.11,
"train_loss": 1.9877741447859942,
"train_runtime": 107245.5846,
"train_samples": 1156437,
"train_samples_per_second": 10.783,
"train_steps_per_second": 0.674
}