near-duplicate-code-detector icon indicating copy to clipboard operation
near-duplicate-code-detector copied to clipboard

Non deterministic output

Open skysb opened this issue 4 years ago • 1 comments

Running the solution with the same tokens jsonl file, the output json of clone groups varies upon each execution. Is there a more deterministic approach to near duplicate detection?

Cheers Sky

skysb avatar Sep 17 '20 09:09 skysb

Apologies for just seeing this issue... If the parallelism is disabled, then you should be able to get a deterministic output. Alternatively, consider using this code which should also yield deterministic results.

mallamanis avatar Dec 17 '20 16:12 mallamanis