datasketch icon indicating copy to clipboard operation
datasketch copied to clipboard

Is the return of MinHashLSH.query() in order

Open charlotte-ling opened this issue 2 years ago • 2 comments

Is the return of MinHashLSH.query() in ascend/descend order by Jaccard similarities

charlotte-ling avatar Mar 22 '22 12:03 charlotte-ling

Same question here. I'm also wondering whether it is possible to get the estimated similarity if we use MinHashLSH.query(), instead of just knowing which keys are above the threshold.

bdeng3 avatar Apr 07 '22 02:04 bdeng3

Is the return of MinHashLSH.query() in ascend/descend order by Jaccard similarities

It is currently not. You can sort it by computing the estimated Jaccard with MinHash. MinHashLSH should be the first step of the retrieval process to locate promising candidates, reducing the computation you need to spent in filtering and ranking the candidates.

ekzhu avatar Jun 02 '22 19:06 ekzhu