html5ever icon indicating copy to clipboard operation
html5ever copied to clipboard

High-performance browser-grade HTML5 parser

Results 52 html5ever issues
Sort by recently updated
recently updated
newest added

HTML parsers are not just used in client-side applications: they are also used on servers, such as in HTML sanitizers. html5ever (and xml5ever) should guarantee that they cannot be coerced...

For example, give an html code like ``` ``` The function will be called when the parser reaches the end tag.

Two of the most common warnings I see reported in Servo's automatic reporting system are `stop_parsing not implemented, full speed ahead!` and `FIXME: /script> not fully implemented`. We should investigate...

We have to query this information from the DOM. Offshoot of #14. cc @SimonSapin

priority-low
XML

For example, there's [an old branch](https://github.com/servo/html5ever/tree/sse) to use SSE 4.2 string instructions in tokenization, which realized a speedup of about 10% and could probably be pushed much further.

performance

Similar to servo/servo#1009 The first step is speculative parsing concurrent with scripts, [similar to what Gecko does](https://developer.mozilla.org/en-US/docs/Mozilla/Gecko/HTML_parser_threading).

performance

A browser runs the [application cache selection algorithm](http://www.whatwg.org/specs/web-apps/current-work/multipage/browsers.html#concept-appcache-init) when the parser sees ``.

web-compat
priority-low

Many parts of the parser code closely match sections of the HTML5 syntax spec. We should include machine-readable comments describing this correspondence, and then generate a HTML document with syntax-highlighted...

doc

An automated test case reducer in the style of [C-Reduce](http://embed.cs.utah.edu/creduce/) would be a useful demo/application for this library.

examples

Part of https://github.com/servo/servo/issues/32454 from @jdm [comment](https://github.com/servo/servo/issues/32454#issuecomment-2158242638)