SafeText icon indicating copy to clipboard operation
SafeText copied to clipboard

Script to remove homoglyphs and zero-width characters to allow for safe distribution of documents from anonymous sources.

Results 3 SafeText issues
Sort by recently updated
recently updated
newest added

I see that you have a small list of homoglyphs in [characters_safetext.py](https://github.com/DavidJacobson/SafeText/blob/master/characters_safetext.py). Unicode has a reference text file for such information that seems pretty comprehensive: [confusables.txt](https://unicode.org/Public/security/latest/confusables.txt) ([Techincal Report](https://www.unicode.org/reports/tr39/#Confusable_Detection)) Would it...

Otherwise I can fingerprint on diacritic form, ligatures, etc. I don't know if it also removes the homoglyphs. Might want to look into that. NFKC does change the appearance of...

good first issue

Great project! I found a list of more non-printing characters from this Unicode document: http://www.unicode.org/charts/PDF/U2000.pdf Some of these characters seem like they could be included in the list of 'unsafe'...