humaneval topic

List humaneval repositories

SkyCode-AI-CodeX-GPT3

393
Stars
21
Forks
Watchers

SkyCode是一个多语言开源编程大模型,采用GPT3模型结构,支持Java, JavaScript, C, C++, Python, Go, shell等多种主流编程语言,并能理解中文注释。模型可以对代码进行补全,拥有强大解题能力,使您从编程中解放出...

can-ai-code

518
Stars
29
Forks
Watchers

Self-evaluating interview for AI coders

code-eval

376
Stars
36
Forks
Watchers

Run evaluation on LLMs using human-eval benchmark

COBOLEval

26
Stars
2
Forks
Watchers

Evaluate LLM-generated COBOL

AutoCoder

787
Stars
68
Forks
Watchers

We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.