near-duplicate-code-detector
near-duplicate-code-detector copied to clipboard
Non deterministic output
Running the solution with the same tokens jsonl file, the output json of clone groups varies upon each execution. Is there a more deterministic approach to near duplicate detection?
Cheers Sky
Apologies for just seeing this issue... If the parallelism is disabled, then you should be able to get a deterministic output. Alternatively, consider using this code which should also yield deterministic results.