albertimff
Results
4
comments of
albertimff
command line: evaluate_functional_correctness samples.jsonl
我觉得你可以自己设计一个reward manager
I have the same question。How to solve it?
so what is the reason?