recordlinkage icon indicating copy to clipboard operation
recordlinkage copied to clipboard

A powerful and modular toolkit for record linkage and duplicate detection in Python

Results 63 recordlinkage issues
Sort by recently updated
recently updated
newest added

when .fit is called and a dataframe of candidate pairs is passed to it, the return type is a multiindex without any 0/1 coding of match.

I am doing Dedup in a single dataset of 1M size in the machine (M5.4xlarge 16 core and 64 GB RAM). I have done the following matching config, but it...

Hi, Maybe a dumb question, but how do you change the norm when you apply different comparison functions? In my case I'd like to change the LCS norm from "dice"...