Dylan Bouchard
Results
1
issues of
Dylan Bouchard
**Summary** Integrate UQLM’s response-level confidence scoring (bounded from 0 to 1) into Ragas via new metrics that calls UQLM `BlackBoxUQ` scorers and/or `WhiteBoxUQ` when token-level logprobs are available. These metrics...
enhancement
module-metrics