byteface
byteface
the possibility for this has opened up a lot more with the html5lib integration. As the parser now provides a great opportunity to write the cssom parsers for live pages...
see release notes for 0.8.2 . separating them out and populating first models of the cssom.
hmmm, been modding expatbuilder and seems to have worked. a decent parseString could be coming, quite soon. can you feel the excitement?.
some links... https://www.tutorialspoint.com/python3/python_xml_processing.htm https://www.computerhope.com/unix/pylibml.htm
this looks exciting... https://github.com/byteface/html5-parser/blob/master/src/html5_parser/dom.py given what i just did with expat. may be able to mod that to generate domonic from huge sites?
I managed to mod the file. easier that I thought... https://github.com/byteface/html5-parser/commit/fa83bf1a5e225f6934a1ad38d092fc6baf3c4934 so that appears to work. even with lots of websites. It seems to build trees with domonic. ``` import...
So the options are to patch that file after each install. or pip install git+https://path to my patched version i need to figure out that path and test. again. But...
https://html5-parser.readthedocs.io/en/latest/
Hi @ipfans , thanks for feedback. There is Not yet a perfect way as I originally only set out to generate html. But it IS on the roadmap. Some more...
to explain maybe a little deeper. and future progress. As parser stuff is undocumented. domonic orignally had a simple regex parser, for tags only no content. which grew. domonic currently...