CodeSearchNet icon indicating copy to clipboard operation
CodeSearchNet copied to clipboard

Functions with original comments

Open timvandam opened this issue 2 years ago • 0 comments

I am looking to use CodeSearchNet to train a code completion model that takes the current code as input and predicts the next token. However, the CodeSearchNet data does not appear to contain the raw comments, making it impossible to re-construct the original code (i.e. it is impossible to tell whether the original comment was a single-line comment or multi-line comment for Java, JavaScript, etc)

Is this data available somewhere, or is my best bet to simply put the plain-text comments inside a multi-line comment block for every sample?

timvandam avatar Jan 25 '23 12:01 timvandam