polars icon indicating copy to clipboard operation
polars copied to clipboard

Implement Series similarity functions

Open mzaks opened this issue 1 year ago • 10 comments

Problem description

Given two Series (column or array) I would like to be able to compute similarity score with Jaccard index.

Further more, other hash based similarity functions like MinHash, SimHash and LSH could be considered when accuracy can be sacrificed in favour of performance.

mzaks avatar Dec 31 '22 12:12 mzaks