minhash topic
flajolet
Probabilistic data structures for OCaml
bloom-filters
JS implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash
datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
sketchy
Sketching Algorithms for Clojure (bloom filter, min-hash, hyper-loglog, count-min sketch)
sketch
C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
LSH
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
sourmash
Quickly search, compare, and analyze genomic and metagenomic data sets.
elasticsearch-minhash
Elasticsearch plugin for b-bit minhash algorism
consimilo
A Clojure library for querying large data-sets on similarity
groot
A resistome profiler for Graphing Resistance Out Of meTagenomes