ML-Bench icon indicating copy to clipboard operation
ML-Bench copied to clipboard

The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.09835)

Results 2 ML-Bench issues
Sort by recently updated
recently updated
newest added

I found that two parameters in script/run.sh were running incorrectly, where the type="quarter" parameter was not defined or used in query_gpt.py. The instructions="extend_instructions" parameter also returns KeyError: 'extend_instructions' How do...