data-matching topic
data-matching-software
A list of free data matching and record linkage software.
recordlinkage
A powerful and modular toolkit for record linkage and duplicate detection in Python
recordlinkage-annotator
A browser user interface for manual labeling of record pairs.
splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
entity-embed
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
fuzzymatcher
Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4
soweego
Link Wikidata items to large catalogs
record-linkage-resources
Resources for tackling record linkage / deduplication / data matching problems
levitate
Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).
nominally
A maximum-strength name parser for record linkage.