phishing_catcher icon indicating copy to clipboard operation
phishing_catcher copied to clipboard

Check domains with special characters

Open serhack opened this issue 8 years ago • 5 comments

Hello, Your repo can help a lot of people, but you should check for domains that have special characters like the ỵ , ṙ . Let me know about updates!

serhack avatar Nov 07 '17 13:11 serhack

It definitely could

x0rz avatar Nov 07 '17 13:11 x0rz

The function score_domain can handle such domains as in python3 every strings are unicode.

Ge0 avatar Nov 07 '17 14:11 Ge0

Lookalike characters should definitely be scored as their normal counterparts in terms of looking at suspicious wording. Just need to translate punycode domains back to unicode, and determine which characters look like other ones.

The Unicode Consortium provides a list of confusables, and all that's needed from that are characters that are confusable with [a-zA-Z0-9\-]

caffeinewriter avatar Nov 13 '17 22:11 caffeinewriter

Could we close @x0rz ?

serhack avatar Sep 06 '18 16:09 serhack

Some issues with the current PR, will close as soon as it detects confusables

x0rz avatar Sep 07 '18 08:09 x0rz