CodeGen icon indicating copy to clipboard operation
CodeGen copied to clipboard

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr...

Results 38 CodeGen issues
Sort by recently updated
recently updated
newest added

Hi, I want to train model for translation pascal2java. I have small datasets about 2 Gb of pascal and 2 Gb of java. I train model mlm and then train...

I am running the CodeGen using the test repository (https://github.com/facebookresearch/CodeGen/tree/main/data/test_dataset) for obfuscation mode `run codegen_sources/preprocessing/preprocess.py data/python_test --mode obfuscation --local True --local_parallelism 4 --langs python --train_splits 1 --tokenization_timeout 400 --bpe_timeout 220...

I am trying to run the preprocessing.py file and getting this unknow error. Can you tell me how to resolve this. ```` run codegen_sources/preprocessing/preprocess.py data/test_dataset --mode obfuscation --langs python --mode...

When I run the CodeGen, the preprocessing.py it is asking for various arguments and when I pass those argument its showing unrecognized arguments. Below are my codes. %run codegen_sources/preprocessing/preprocess data/test_dataset...

question

Hi, I am trying to follow the instruction in dobf.md to evaluate Transcoder_model_1.pth on Clone detection. After I run following command, the error related to reloading the model appears. I...

bug

Hi, When I was following the instructions from https://github.com/facebookresearch/CodeGen/blob/main/docs/TransCoder-ST.md to create self-training dataset. The create_self_training_dataset.sh failed with the Assertion Error: ######### Creating Tests ########## Traceback (most recent call last): File...

bug

Hi, appreciate the amazing work in unsupervised code translation! I wonder if you have done ablation study on the training data size of TransCoder? Since the unsupervised model needs way...

question

I am working on a project where I have to convert Teradata SQL queries to redshift queries. This has to be then generalized for other dialects to like for eg...

question

I try to run CodeGen on Windows. But error occurred when I install fastBPE with pip3.I donot know how to solve.Could anyone help me? The error log list here: Collecting...

enhancement

Hi, as the paper "Unsupervised Translation of programming languages" mentioned, there are 852 parallel functions. So I checked the data in this repo [fold](https://github.com/facebookresearch/CodeGen/tree/main/data/transcoder_evaluation_gfg) and found (each file contain one...

question