kjain14
kjain14
Adding MuttFuzz (a binary fuzzing alternative to our "First, Fuzz the Mutants" paper). This will likely integrate better with fuzzbench and thus work on experiments, as we are mutating binaries...
Adding MuttFuzz (a binary fuzzing alternative to our "First, Fuzz the Mutants" paper). This will likely integrate better with fuzzbench and thus work on experiments, as we are mutating binaries...
**End-user friendly description of the problem this fixes or functionality that this introduces** Adds a new unit test generation benchmark TestGenEval: https://arxiv.org/abs/2410.00752 --- **Give a summary of what the PR...
### Describe the bug Running on the latest SWE-bench, I am unable to run gold patch on django__django-10097. ### Steps/Code to Reproduce ``` ❯ python -m swebench.harness.run_evaluation \ --predictions_path gold...
### Describe the bug Running gold patch on `psf__requests-2317`: ``` requests/packages/urllib3/connectionpool.py:558: in urlopen retries = retries.increment(method, url, error=e, requests/packages/urllib3/util/retry.py:245: in increment raise six.reraise(type(error), error, _stacktrace) requests/packages/urllib3/packages/six.py:309: in reraise raise value.with_traceback(tb)...