evaluate
evaluate copied to clipboard
[Feature] Add G-Pass@k Metric
An implementation of G-Pass@k metric described in https://arxiv.org/abs/2412.13147