CodeRL icon indicating copy to clipboard operation
CodeRL copied to clipboard

This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

Results 46 CodeRL issues
Sort by recently updated
recently updated
newest added

Bumps [transformers](https://github.com/huggingface/transformers) from 3.5.1 to 4.30.0. Release notes Sourced from transformers's releases. v4.30.0: 100k, Agents improvements, Safetensors core dependency, Swiftformer, Autoformer, MobileViTv2, timm-as-a-backbone 100k Transformers has just reached 100k stars...

dependencies

Bumps [requests](https://github.com/psf/requests) from 2.22.0 to 2.31.0. Release notes Sourced from requests's releases. v2.31.0 2.31.0 (2023-05-22) Security Versions of Requests between v2.3.0 and v2.30.0 are vulnerable to potential forwarding of Proxy-Authorization...

dependencies

Bumps [requests](https://github.com/psf/requests) from 2.22.0 to 2.31.0. Release notes Sourced from requests's releases. v2.31.0 2.31.0 (2023-05-22) Security Versions of Requests between v2.3.0 and v2.30.0 are vulnerable to potential forwarding of Proxy-Authorization...

dependencies

@henryhungle This is a really nice work that is briging the gap between SE and AI! However, it seems incomplete yet. In the README, you said that "We will release...

Hi authors, can you add some documentation for the `test_one_solution.py` file please? I have 2 questions here - what does `test_one_solution.py` do? (what's the input and output?) - How to...

Hello, I noticed that you have trained a classification model (Critic). How did you overcome the problem of category imbalance in data? As far as I know, accepted solutions should...

Hi, I wonder what's the exact input formats for APPS/MBPP problems to be fed into [CodeT5-large-ntp-py](https://huggingface.co/Salesforce/codet5-large-ntp-py) or [CodeT5-finetuned_CodeRL](https://console.cloud.google.com/storage/browser/sfr-coderl-research/codet5_finetuned_codeRL)? I tried """{Problem}""" but it doesn't work well, generating a lot of...

I am reading your CodeRL paper. It uses the APPS benchmark to show the performance comparison with Codex. Do you have any comparison results using the HumanEval dataset?

start_idx variable in line `problem_idx += start_idx` is not defined in extract_example_test.ipynb