datasketch icon indicating copy to clipboard operation
datasketch copied to clipboard

How to delete index from MinhashLSH forest?

Open charlotte-ling opened this issue 3 years ago • 1 comments

I wanna delete a index from MinhashLSH forest, but I didn't find "remove" function in forest like that in lsh

charlotte-ling avatar Apr 22 '22 06:04 charlotte-ling

I assume you meant deleting a key?

It is difficult to actually delete from LSH Forest, as it is implemented using sorted arrays. However it is possible to add a hash table for all keys indexed, and "fake" delete the key from there. I am just not sure how much of a performance overhead that would be as it requires checking the new hash table for every keys retrieved.

ekzhu avatar Jun 02 '22 19:06 ekzhu