autoextract-spiders icon indicating copy to clipboard operation
autoextract-spiders copied to clipboard

Better de-duplication of URLs

Open croqaz opened this issue 5 years ago • 0 comments

When discovering URLs from different seeds, the URLs are not deduplicated if they are found in multiple seeds. There is local de-duplication during discovery, and there's also the built-in DupeFilters. Would need investigation.

croqaz avatar Nov 06 '19 12:11 croqaz