datasketch
datasketch copied to clipboard
Is the return of MinHashLSH.query() in order
Is the return of MinHashLSH.query() in ascend/descend order by Jaccard similarities
Same question here. I'm also wondering whether it is possible to get the estimated similarity if we use MinHashLSH.query()
, instead of just knowing which keys are above the threshold.
Is the return of MinHashLSH.query() in ascend/descend order by Jaccard similarities
It is currently not. You can sort it by computing the estimated Jaccard with MinHash. MinHashLSH should be the first step of the retrieval process to locate promising candidates, reducing the computation you need to spent in filtering and ranking the candidates.