record-linkage topic
data-matching-software
A list of free data matching and record linkage software.
FEBRL-fork-v0.4.2
Fork of the Freely Extensible Biomedical Record Linkage program
recordlinkage
A powerful and modular toolkit for record linkage and duplicate detection in Python
recordlinkage-annotator
A browser user interface for manual labeling of record pairs.
dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
spark-lucenerdd
Spark RDD with Lucene's query and entity linkage capabilities
csvdedupe
:id: Command line tool for deduplicating CSV files
dedupe-examples
:id: Examples for using the dedupe library
libpostal
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.