WizardLM
WizardLM copied to clipboard
evaluate_functional_correctness ${output_path}.jsonl 得到的pass@k结果全是1
I think you first need to check whether you have generated the codes correctly. Then check whether you install the humaneval environment correctly. Since your capture only provides limit information, it is hard for me to provide a solution.