minhash topic

List minhash repositories

similarity-search-java

20
Stars
10
Forks
Watchers

Easy-to-use Java similarity algorithms for text and numeric-series

minhash

34
Stars
10
Forks
Watchers

This provides tools for b-bit MinHash algorism.

text-shingles

19
Stars
11
Forks
Watchers

k-shingling for text to help compare similarity

treeminhash

15
Stars
4
Forks
Watchers

TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation

rensa

57
Stars
3
Forks
Watchers

High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets