aider
aider copied to clipboard
Benchmark features: pause/resume, language breakdown, total api calls
This PR adds several enhancements to the benchmarking functionality:
- Added ability to pause and resume benchmark
- Updated language print formatting
- Added enhanced benchmark metrics including API calls, retries, and language-specific pass rates
These changes improve the benchmarking experience and provide more detailed metrics for analysis.