Dedupe.io

Results 9 repositories owned by Dedupe.io

dedupe

4.0k
Stars
540
Forks
Watchers

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

csvdedupe

404
Stars
83
Forks
Watchers

:id: Command line tool for deduplicating CSV files

dedupe-examples

394
Stars
216
Forks
Watchers

:id: Examples for using the dedupe library

affinegap

58
Stars
9
Forks
Watchers

:triangular_ruler: A Cython implementation of the affine gap string distance

address-matching

58
Stars
19
Forks
Watchers

Python script for matching a list of messy addresses against a gazetteer using dedupe.

dedupe-geocoder

17
Stars
6
Forks
Watchers

:round_pushpin: Demonstration of how dedupe might be used as geocoder

hcluster

35
Stars
20
Forks
Watchers

Hierarchical Clustering Algorithms

pyhacrf

24
Stars
12
Forks
Watchers

:triangular_ruler: Hidden alignment conditional random field for classifying string pairs.

pylbfgs

24
Stars
17
Forks
Watchers

:mountain_cableway: Python/Cython wrapper for liblbfgs