polars
polars copied to clipboard
Implement Series similarity functions
Problem description
Given two Series (column or array) I would like to be able to compute similarity score with Jaccard index.
Further more, other hash based similarity functions like MinHash, SimHash and LSH could be considered when accuracy can be sacrificed in favour of performance.