superminhash icon indicating copy to clipboard operation
superminhash copied to clipboard

SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex

SuperMinHash, Simhash and SimhashIndex

SuperMinHash

A New Minwise Hashing Algorithm for Jaccard Similarity Estimation

This is an implementation of Otmar Ertl's paper with the same title. The implementation is still in progress but almost there...

It is fork to Python from Go (source https://github.com/seiflotfy/superminhash)

Simhash and SimhashIndex

It is fork and redesign (source https://github.com/leonsim/simhash)