pylinkvalidator icon indicating copy to clipboard operation
pylinkvalidator copied to clipboard

Invalid IPv6 URL

Open jimpriest opened this issue 10 years ago • 2 comments

When checking some URLs I get the following error:

error (<type 'exceptions.ValueError'>): Invalid IPv6 URL: 

Even though the URL is is not formatted unusually.

Scan http://verticalindustriesblog.redhat.com/ with depth=1 for some examples.

I may modify my fork to just ignore this error but I'm not sure there is a correct way to 'fix' it. Googling it seems like an issue with Python 2.7.x.

I see it both on 2.7.5 and 2.7.10.

jimpriest avatar Sep 22 '15 13:09 jimpriest

Hi Jim, just to be sure, are we talking about URLs such as http://[http//w.on24.com/r.htm?e=991027&s=1&k=DBEA8D7CD7CF38AE3A007AB5432DAC2B&partnerref=sapredhat found on this page: http://verticalindustriesblog.redhat.com/tune-in-red-hat-sap-and-tabb-group-discuss-high-performance-computing-its-growth-in-financial-services-and-its-shrinking-cost/

Trying this link on firefox raises a Server Not Found error (not even a 404). I guess, pylinkvalidator should report a sensible error when the URL is not parsable. Just want to make sure I'm not missing other cases.

bartdag avatar Sep 23 '15 00:09 bartdag

I think what is happening is the WYSIWG editor is trying to fix these links by adding an extra http// in the mix. Not sure where the bracket is coming from but they aren't IPv6 urls.

So yes, I think maybe just a more generic 'unparsable url found' error message may be more useful?

jimpriest avatar Sep 23 '15 13:09 jimpriest