Sasha Harrison

Results 4 issues of Sasha Harrison

Fixes some problems with 3D evaluation metrics and adds unit test coverage on the numerical correctness of metric functions. Specific issues found: - enforcing label matches not properly taken into...

Counterpart to: https://github.com/scaleapi/scaleapi/pull/36062