string-distance topic

List string-distance repositories

triple_accel

95
Stars
11
Forks
Watchers

Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.

fuzzywuzzy

70
Stars
23
Forks
Watchers

Fuzzy string matching for PHP

mudderjs

112
Stars
9
Forks
Watchers

Lexicographically-subdivide the “space” between strings, by defining an alternate non-base-ten number system using a pre-defined dictionary of symbol↔︎number mappings. Handy for ordering NoSQL keys.

levenshtein

87
Stars
6
Forks
Watchers

Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.

affinegap

58
Stars
9
Forks
Watchers

:triangular_ruler: A Cython implementation of the affine gap string distance

stance

33
Stars
3
Forks
Watchers

Learned string similarity for entity names using optimal transport.

string-dist

16
Stars
3
Forks
Watchers

A Python library for calculating string distances using C extensions (with a pure Python fallback)

seqalign

25
Stars
2
Forks
Watchers

Collection of sequence alignment algorithms.

pyhacrf

24
Stars
12
Forks
Watchers

:triangular_ruler: Hidden alignment conditional random field for classifying string pairs.

levenshtein_finder

21
Stars
2
Forks
Watchers

Similar string search in Levenshtein distance