Paul Tremberth
Paul Tremberth
This needs to be moved to [`w3lib`](https://github.com/scrapy/w3lib/issues), the new home of `canonicalize_url`
This offending unquoting [happens in `w3lib.url._unquotepath`](https://github.com/scrapy/w3lib/blob/f46b4c4140dfd0081d34327d91f496f7a221bed7/w3lib/url.py#L502). It only considers `/` and `?`
What I commented earlier is not really relevant. The issue is not so much unquoting `%23` but instead not percent-encoding `#` afterwards when re-building the URI. (And maybe `?` should...
@HyperHCl , I'm not sure this is the right place to report this decoding issue. Have you submitted the issue to the Python Core developers?
@HyperHCl , I see. But where does this fit w3lib?
Can you add tests for this? Can you provide example websites showing this issue?
@denity , if you're referring to : ``` 2017-05-18 11:25:57 [twisted] CRITICAL: Unhandled Error Traceback (most recent call last): File "/home/paul/.virtualenvs/scrapy-jsonrpc.py2/local/lib/python2.7/site-packages/twisted/protocols/basic.py", line 571, in dataReceived why = self.lineReceived(line) File "/home/paul/.virtualenvs/scrapy-jsonrpc.py2/local/lib/python2.7/site-packages/twisted/web/http.py",...
I don't understand the issue/the behavior you want to be documented. Can you explain with a timeline what's happening?
Hello @mrueegg , sorry it took so long but I had a look at this again this morning, and I think I understand the issue now. I'm a bit slow...
63 videos as of today 2018-12-13