urlcanon
urlcanon copied to clipboard
Percent encoded non-UTF8 sequences (Java version)
SemanticPrecise changes "%C3%23" in the query string to "%ef%bf%bd%23" which is different from both python urlcanon ("%c3%23") and webarchive-commons ("%25c3%23").
See also: https://github.com/iipc/webarchive-commons/pull/102