LSH
LSH copied to clipboard
How to make minhash scalable
If suppose I have 100,000 sentences or document. and I want to find the pairwise jaccard similarity. How to make minhash algorithm scalable? could please add the example for the same.