similarity-metric topic

List similarity-metric repositories

stringmetric

485
Stars
82
Forks
Watchers

:dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS,...

L2C

313
Stars
49
Forks
Watchers

Learning to Cluster. A deep clustering strategy.

SGRAF

200
Stars
37
Forks
Watchers

[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”

levenshtein

87
Stars
6
Forks
Watchers

Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.

rltk

103
Stars
23
Forks
Watchers

Record Linkage ToolKit (Find and link entities)

spark-fuzzy-matching

23
Stars
10
Forks
Watchers

Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)

treeminhash

15
Stars
4
Forks
Watchers

TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation