ML-Bench
ML-Bench copied to clipboard
The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.09835)
Results
2
ML-Bench issues
Sort by
recently updated
recently updated
newest added
I found that two parameters in script/run.sh were running incorrectly, where the type="quarter" parameter was not defined or used in query_gpt.py. The instructions="extend_instructions" parameter also returns KeyError: 'extend_instructions' How do...