Sitemap-Generator-Crawler icon indicating copy to clipboard operation
Sitemap-Generator-Crawler copied to clipboard

allow to validate non ascii urls (fixes #57)

Open francisek opened this issue 7 years ago • 3 comments

Add function url_to_ascii that converts utf8 url to plain ascii one.

francisek avatar Oct 07 '17 12:10 francisek

This covers domains but not URL paths.

Half way there.

vezaynk avatar Oct 07 '17 12:10 vezaynk

I'm trying to test this but idn_to_ascii isn't on my system.. I have pretty standard install and the function is visible on the official docs without the need for any extensions. I have a fairly standard install too so this is strange.

This cannot be merged without a shim.

vezaynk avatar Oct 08 '17 17:10 vezaynk

idn_to_ascii is related to the intl extension. We could use pure php shim with an external library like https://github.com/phlylabs/idna-convert as the converstion rules are a bit complicated.

francisek avatar Oct 24 '17 07:10 francisek