ClassEval
ClassEval copied to clipboard
Do you plan to update the benchmark?
This is a good benchmark, thank you for that. Do you plan to add modern models like Opus, llama-3, granite, codeqwen1.5-chat and so on to the benchmark?
Thanks for your feedback! We are currently working on this and plan to add the latest models to the benchmark soon.