verifiers icon indicating copy to clipboard operation
verifiers copied to clipboard

Difficulty filtering

Open faresobeid opened this issue 3 months ago • 0 comments

With vf_eval.make_dataset, having support for an extra column for average_accuracy per prompt (over rollouts_per_example) would make difficulty filtering very easy

faresobeid avatar Sep 22 '25 22:09 faresobeid