record-linkage topic

List record-linkage repositories

data-matching-software

350
Stars
41
Forks
Watchers

A list of free data matching and record linkage software.

FEBRL-fork-v0.4.2

23
Stars
21
Forks
Watchers

Fork of the Freely Extensible Biomedical Record Linkage program

recordlinkage

915
Stars
150
Forks
Watchers

A powerful and modular toolkit for record linkage and duplicate detection in Python

recordlinkage-annotator

41
Stars
8
Forks
Watchers

A browser user interface for manual labeling of record pairs.

dedupe

4.0k
Stars
540
Forks
Watchers

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

spark-lucenerdd

126
Stars
35
Forks
Watchers

Spark RDD with Lucene's query and entity linkage capabilities

csvdedupe

404
Stars
83
Forks
Watchers

:id: Command line tool for deduplicating CSV files

dedupe-examples

394
Stars
216
Forks
Watchers

:id: Examples for using the dedupe library

libpostal

4.0k
Stars
411
Forks
Watchers

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

talisman

701
Stars
50
Forks
Watchers

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.