pyrepseq icon indicating copy to clipboard operation
pyrepseq copied to clipboard

symdel implementation returns edges twice

Open yutanagano opened this issue 1 year ago • 0 comments

If strings at index x and y are sufficiently similar to one another at distance d, the current symdel implementation returns both (x, y, d) and (y, x, d). Unless we plan to support non-symmetric divergences in place of proper metrics like levenshtein, this is redundant behaviour that can be harmful at large numbers of query strings. Is this behaviour intentional?

yutanagano avatar Nov 08 '24 22:11 yutanagano