VulnerabilityDetection icon indicating copy to clipboard operation
VulnerabilityDetection copied to clipboard

Common samples within training and test set

Open Keramatfar opened this issue 1 year ago • 0 comments

Hi, As you provided in the paper, for each vulnerable token you make some samples (perhaps 200) with a moving window. My question is do you prevent the samples from one token to be common in training and test sets? Because there is many common tokens between them, results in optimistic performance.

Keramatfar avatar Dec 06 '22 10:12 Keramatfar