lighteval
lighteval copied to clipboard

Published 20 hours ago •

Reame
Issues

[FT] Adding caching for each dataset run

Open JoelNiklaus opened this issue 10 months ago • 2 comments

Issue encountered

When running large evals with many dataset configurations it is very painful to rerun everything in case something fails.

Solution/Feature

It would be great if intermediate results could be cached, for example the computed metrics of each dataset.

Dec 02 '24 22:12 JoelNiklaus