codefuse-evaluation icon indicating copy to clipboard operation
codefuse-evaluation copied to clipboard

Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中

Results 5 codefuse-evaluation issues
Sort by recently updated
recently updated
newest added

Hi, CodeFuse-AI team, I am interested in evaluating several code assistant products. However, I do not possess a large-scale code model of my own. Instead, what I have are the...

This work is very exciting!

Add natural language to code evaluation language dataset for Java and JavaScript.

# Add Comprehensive Python Testing Infrastructure ## Summary This PR establishes a complete testing infrastructure for the CodeFuse evaluation framework, transitioning from basic requirements.txt dependency management to a modern Poetry-based...