dedupe topic

List dedupe repositories

duplicut

799
Stars
90
Forks
Watchers

Remove duplicates from MASSIVE wordlist, without sorting it (for dictionary-based password cracking)

mail-deduplicate

160
Stars
38
Forks
Watchers

📧 CLI to deduplicate mails from mail boxes.

recordlinkage

915
Stars
150
Forks
Watchers

A powerful and modular toolkit for record linkage and duplicate detection in Python

jdupes

1.7k
Stars
138
Forks
Watchers

A powerful duplicate file finder and an enhanced fork of 'fdupes'.

free-style

701
Stars
29
Forks
Watchers

Make CSS easier and more maintainable by using JavaScript

borg

10.6k
Stars
729
Forks
Watchers

Deduplicating archiver with compression and authenticated encryption.

restic

24.8k
Stars
1.5k
Forks
238
Watchers

Fast, secure, efficient backup program

dedupe

4.0k
Stars
540
Forks
Watchers

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

zingg

902
Stars
109
Forks
Watchers

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

yarn-deduplicate

1.4k
Stars
55
Forks
Watchers

Deduplication tool for yarn.lock files