awesome-ai4code icon indicating copy to clipboard operation
awesome-ai4code copied to clipboard

A collection of recent papers, benchmarks and datasets of AI4Code domain.

A Collection of Tools, Papers and Recent Trendings in AI for Software (AI4SE, AI4Code).

A niche collection of AI4Code papers and other resources (dataset, tutorial, etc.). There are also other collections that cover a wider range of content about AI4Code, such as:

  • https://ml4code.github.io/
  • https://github.com/src-d/awesome-machine-learning-on-source-code (Due to the lack of maintainers, this collection may be obsolete).

Tools/Products

A list of of AI coding tools (assistants, completion, refactoring, etc.).

AI code completion tools

More General Coding Assistants

ChatGPT in your editor

LLM-powered natural language compilers

Academic

Conferences

Software Enginnering/Programming Languages

The emphasis is on combining program analysis and deep learning to solve novel software engineering/programming languages task. In most cases, strong empirical results are required. Typically, new datasets are usually curated.

Machine Learning/AI

The emphasis is on desigining novel neural network architectures to process code. Typically, new datasets are usually curated.

Natural Language Processing

The emphasis is on applying NLP techniques for code, and the evaluation is primarily on running the models on known benchmark datasets; unique tasks are rarely introduced.

Papers (This list is a bit outdated, need to update)

Evaluate CodeLLMs

Techniques for Code Generation

  • [SKCODER: A Sketch-based Approach for Automatic Code Generation)(https://arxiv.org/pdf/2302.06144.pdf) - Jia Allen Li, Yongmin Li, Ge Li, Zhi Jin, Yiyang Hao, Xing Hu

Repo-Level CodeLLMs

Benchmarking CodeLLMs

Pretrained CodeLLMs

Dataset and Benchmark

Talks and Tutorials

Articles