Support HTML Parsing
This adds support for HTML parsing.
Great idea @amenk
@amenk You can try to fix formatting problems with:
composer fix-lint
Try also
composer refactor && composer test
@f-lombardo thanks, currently not working on this
Will hopefully pick up later.
@amenk @f-lombardo what about just use WebPageTextGetter class in Tool? To get the text from html should be quite enough no? (even if quite messy ^^)
@amenk @f-lombardo what about just use WebPageTextGetter class in Tool? To get the text from html should be quite enough no? (even if quite messy ^^)
Well, it's an option, even if whe should change it a bit in order to parse also HTML coming from a file. Another thing to consider is if LLPhant should handle the parsing of complex HTML pages by itself or it has to delegate that to an external specialized library.