edoardocontente

Results 1 issues of edoardocontente

It would be great to support logprobs to evaluate some of the benchmarks! Some of them natively "require" scoring via logprobs rather than via the generated answer, so it would...