discord-urls-extractor
discord-urls-extractor copied to clipboard
Some URLs contain unprintable characters
Preferably, the regex should be able to catch that, but alternatively, we could just sanitise the data (with a command-line option to also print the bad one, in case it was intentional, and a command-line option to not sanitise it at all)
List of broken URLs from my Morbius grab:
https://transfer.archivete.am/nM4yy/discord-morbius-outdated.not-printable.txt
(Open it up in a hex editor to see the problem.)