VulnerabilityDetection
VulnerabilityDetection copied to clipboard
Common samples within training and test set
Hi, As you provided in the paper, for each vulnerable token you make some samples (perhaps 200) with a moving window. My question is do you prevent the samples from one token to be common in training and test sets? Because there is many common tokens between them, results in optimistic performance.