yarl icon indicating copy to clipboard operation
yarl copied to clipboard

Over-zealous empty query/fragment normalization like bpo-37969

Open nicktimko opened this issue 4 years ago • 0 comments

I saw https://bugs.python.org/issue37969 and was curious how YARL handles an empty querystring given that RFC 3986 § 6.2.3 says

Normalization should not remove delimiters when their associated component is empty unless licensed to do so by the scheme specification. For example, the URI "http://example.com/?" cannot be assumed to be equivalent to any of the examples ["http://example.com", "http://example.com/", "http://example.com:/", "http://example.com:80/"].

† Is there a separate spec for HTTP that says you can? Would this impact a generic URL parser anyways?

>>> yarl.URL("http://example.com/?#") == yarl.URL("http://example.com/")
True

And I can't see a way to yarl.URL.build to end up with a blank (but delimited) querystring/fragment

Wondering if this would just be a 'won't-fix' in YARL, or if there's some clean design solution. Also if any URL-savants wanted to render an opinion in the bugs.python.org issue.


⁂ even more niche "issue":

>>> yarl.URL("http://example.com:80/") == yarl.URL("http://example.com:/")
False

nicktimko avatar Sep 03 '19 19:09 nicktimko