robinson icon indicating copy to clipboard operation
robinson copied to clipboard

use html5lib instead of lxml

Open Ksengine opened this issue 4 years ago • 0 comments

can you remove lxml and use html5lib

html5lib is a pure-python library for parsing HTML. It is designed to conform to the WHATWG HTML specification, as is implemented by all major web browsers.

By default, the document will be an xml.etree element instance. Whenever possible, html5lib chooses the accelerated ElementTree implementation (i.e. xml.etree.cElementTree on Python 2.x).

Two other tree types are supported: xml.dom.minidom and lxml.etree. To use an alternative format, specify the name of a treebuilder:

Ksengine avatar Dec 07 '20 11:12 Ksengine