datasketch
datasketch copied to clipboard
Speed up MinHash and LSH using One-Permutation Hashing
One-Permutation hashing seems to speed up MinHash creation without loosing much accuracy.
We can try this out. However this really depends on the accuracy-speed trade off. Also I would put this as lower priority comparing to #109 due to memory being more important for big data analytics.