linkchecker
linkchecker copied to clipboard
ascii-error with HtmlSyntaxCheck
Hi,
i have a config containing "[HtmlSyntaxCheck]". But i just keep getting an ascii error...
~>linkchecker -f config/01_linkcheck -Fhtml/reports/linkchecker/test.html www.google.de
INFO 2015-06-11 16:08:05,824 MainThread Checking intern URLs only; use --check-extern to check extern URLs.
LinkChecker 9.3 Copyright (C) 2000-2014 Bastian Kleineidam
LinkChecker comes with ABSOLUTELY NO WARRANTY!
This is free software, and you are welcome to redistribute it
under certain conditions. Look at the file `LICENSE' within this
distribution.
Die neueste Version gibt es unter http://wummel.github.io/linkchecker/
Schreiben Sie Kommentare und Fehler an https://github.com/wummel/linkchecker/issues
Unterstütze dieses Projekt unter http://wummel.github.io/linkchecker/donations.html
Beginne Prüfen am 2015-06-11 16:08:05+002
1 thread active, 0 links queued, 0 links in 0 URLs checked, Laufzeit 1 Sekunden
WARNUNG 2015-06-11 16:08:07,799 CheckThread-http://www.google.de HTML syntax check plugin error: 'ascii' codec can't encode character u'\u2026' in position 1427: ordinal not in range(128)
Does anyone know how to fix this?
regards, strowi
This still an issue in LinkChecker 9.3, Debian Jessie.
Note that the following returns no results (i.e. the page does not contain non-ASCII characters).
wget http://www.google.de
grep -P -n "[^[:ascii:]]" index.html
Thank you for the issue report. Sadly this project is dead, and a new team is around with https://github.com/linkcheck/linkchecker for more details please see: #708 Also please close this issue and report it freshly on the new repo https://github.com/linkcheck/linkchecker/issues if your issue still persists