dns-blocklists icon indicating copy to clipboard operation
dns-blocklists copied to clipboard

Using CWF API to Bulk Classify Domains

Open sr093906 opened this issue 2 years ago • 2 comments

I haven't tried it myself. So, I cannot ensure the viability.Maybe you can have a try.

https://cwf.comodo.com/subscriptions.php

The available categories are listed here https://cwf.comodo.com/categories.php

Find common entries between pro,plus version and top 1M.

Then, removing malicious ones identified by Google Safe browsing API (https://github.com/elliotwutingfeng/Inversion-DNSBL-Blocklists/blob/main/Google_hostnames.txt?raw=true) and so-called NSFW ones in https://oisd.nl/downloads.

Removing confirmed ad and tracking entries based on some sources you trust.

After that, filtering out domains with keywords such as 'sex', 'porn','adv', 'click', 'bet', "casino' and others to further reduce the amount.

And finally, using the free API to categorize the remainder.

I believe the number shall be less than 20000 ones.

sr093906 avatar Oct 10 '22 03:10 sr093906

Thanks for the suggestion. I'll take a closer look.

hagezi avatar Oct 10 '22 06:10 hagezi

Thanks for reply. Hoping it will be helpful.

sr093906 avatar Oct 10 '22 11:10 sr093906