Dylan Bouchard

Results 1 issues of Dylan Bouchard

**Summary** Integrate UQLM’s response-level confidence scoring (bounded from 0 to 1) into Ragas via new metrics that calls UQLM `BlackBoxUQ` scorers and/or `WhiteBoxUQ` when token-level logprobs are available. These metrics...

enhancement
module-metrics