yang335's picture
End of training
7dc072b verified
{
"epoch": 2.8847926267281108,
"total_flos": 1.6465460086177792e+16,
"train_loss": 0.6967043181260427,
"train_runtime": 4083.1676,
"train_samples_per_second": 0.318,
"train_steps_per_second": 0.013
}