sbite0138

Results 1 issues of sbite0138

The script provides a `--max_examples` option to limit the number of evaluation samples. However, `total `is incremented and compared to `max_examples` _before_ the per‑example score is recorded. If `--max_examples` is...