string-similarity topic

List string-similarity repositories

levenshtein

87
Stars
6
Forks
Watchers

Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.

rltk

103
Stars
23
Forks
Watchers

Record Linkage ToolKit (Find and link entities)

levitate

34
Stars
2
Forks
Watchers

Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).

learned-string-alignments

38
Stars
6
Forks
Watchers

Learning String Alignments for Entity Aliases

stance

35
Stars
3
Forks
Watchers

Learned string similarity for entity names using optimal transport.

Trie

42
Stars
11
Forks
Watchers

A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.

UMICollapse

52
Stars
8
Forks
Watchers

Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.

beda

50
Stars
3
Forks
Watchers

Beda is a golang library for detecting how similar a two string

string-similarity-js

79
Stars
12
Forks
Watchers

Lightweight string similarity function for javascript