verifiers icon indicating copy to clipboard operation
verifiers copied to clipboard

Display running average of metrics during rollout generation + scoring

Open mikasenghaas opened this issue 3 months ago • 4 comments

Would be cool to show a running avg. of metrics during generation+scoring, most basic ones i can think of are reward and seq_len in the tqdm bar description, that are updated as rollouts complete

mikasenghaas avatar Oct 03 '25 21:10 mikasenghaas

@mikasenghaas I started experimenting with this idea. By seq_len, do you mean the completion length?

anakin87 avatar Oct 05 '25 11:10 anakin87

probably completion len yea

mikasenghaas avatar Oct 05 '25 12:10 mikasenghaas

I'll open a PR soon (in the next few days)

anakin87 avatar Oct 06 '25 07:10 anakin87

@mikasenghaas is this still relevant? I opened #443 but will require more work after the eval refactoring. LMK and I'll adapt my PR.

anakin87 avatar Oct 18 '25 15:10 anakin87