Results 2 issues of James

**Work in progress...** A huge rewrite of the html module that follows the [HTML standard](https://html.spec.whatwg.org/multipage/parsing.html). The goal of this rewrite is to bring the html parsing closer to how browsers...

The `unescape_characters` method is slow, so the places it is called have been reduced. Ideally it's performance would be improved, but the html module is in the process of being...