eval-dev-quality
eval-dev-quality copied to clipboard
Evaluation task: TDD
Goal
Given an implementation and a test suite, and an additional failing test. Let the model modify the implementation such that all tests pass.