record-linkage topic

List record-linkage repositories

splink

1.1k
Stars
127
Forks
Watchers

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

entity-embed

139
Stars
13
Forks
Watchers

PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.

LIMES

126
Stars
54
Forks
Watchers

Link Discovery Framework for Metric Spaces.

anonlink

60
Stars
7
Forks
Watchers

Python implementation of anonymous linkage using cryptographic linkage keys

soweego

95
Stars
8
Forks
Watchers

Link Wikidata items to large catalogs

rltk

103
Stars
23
Forks
Watchers

Record Linkage ToolKit (Find and link entities)

record-linkage-resources

105
Stars
15
Forks
Watchers

Resources for tackling record linkage / deduplication / data matching problems

blocklib

19
Stars
3
Forks
Watchers

Python implementations of record linkage blocking techniques.