string-similarity topic
levenshtein
Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.
rltk
Record Linkage ToolKit (Find and link entities)
levitate
Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).
learned-string-alignments
Learning String Alignments for Entity Aliases
stance
Learned string similarity for entity names using optimal transport.
Trie
A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.
UMICollapse
Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.
beda
Beda is a golang library for detecting how similar a two string
string-similarity-js
Lightweight string similarity function for javascript