edoardocontente
Results
1
issues of
edoardocontente
It would be great to support logprobs to evaluate some of the benchmarks! Some of them natively "require" scoring via logprobs rather than via the generated answer, so it would...