Baptiste Roziere comments

Results 21 comments of


                                            Baptiste Roziere

Ablation on data size

Hi, Thank you. We have not really done an ablation study on the dataset size. However, the numbers you are quoting are for non deduplicated functions. We get about the...

Question for validation and test sets

Hi and sorry for the late answer, We created the tests by generating test cases for each problem, then running the generated tests on the ground truth. We considered the...

UncompletedJobError: No output/error stream produced

Hi, It may be because all 35 examples in the python file you kept are sent to the training set. Maybe train running it on the 3 python files in...

Could CodeGen run on Windows?

Hi. As your issue is with fastBPE, you may want to ask them if they support windows on [their repo](https://github.com/glample/fastBPE). We did not test our code on windows and you...

Panda error while creating Online test data

Hi, So it's failing when trying to consolidate the test summaries created by evosuite. You could rerun the test creation command with --rerun=True (or delete the `tests` directory in your...

Can this be used to translate between SQL dialects?

Hi. Given sufficient data, I believe that you could use the same objectives as TransCoder to translate between SQL dialects.

Evaluate Transcoder_model_1 on CodeXGlue benchmark

Hi, TransCoder is not really made for pre-training a model for things like clone detection. What you are trying to do would reload only the encoder of TransCoder and fine-tune...

Evaluate Transcoder_model_1 on CodeXGlue benchmark

Hi, We definitely managed to test models with the same encoder parameters as TransCoder on CodeXGlue before. I did not test it recently, and I guess there will still be...

Code translation inference optimzation

It's done on GPUs but we didn't really optimize the translate.py script. For instance it can only take one example at a time. If you want to translate several functions,...

Outputted files have erroneous characters [bug on mac os]

Hi, I guess you are referring to the @@ characters ? These characters were created by fastBPE when breaking the tokens into subtokens. To undo the BPE, you can just...