AdvancedHTMLParser
AdvancedHTMLParser copied to clipboard
Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modification, and formatting. Also XPath.
First, great job. Really like AdvancedHTMLParser. Second, a request or two: on the xpath front, is it possible to add support for the `count()` and `concat()` methods? Thanks.
Hello, thank you for this good project. is it possible to parse an html that is inside javascript ? i tried to `getElementsByClassName('nc_ensemble')` but the TagCollection was empty ```js var...
Hey Tim, it's Tim! I thought a project I built recently, might be useful to better showcase your project: [https://github.com/timothycrosley/portray](https://github.com/timothycrosley/portray) Switches to using portray and github pages to host your...
Hi! Nice library. I have only one small problem. When i try print tag which contain attribute with None value, print fail. Here is simple code to reproduce that: ```...
Hi there! I was trying to fuzz your interesting library as part of my university testing task, when I encountered interesting detail - there is possibility to open debugger at...
**Desctiption**: Getting an `AttributeError` when passing an html-like string with a corrupted `` tag in the `AdvancedHTMLParser.AdvancedHTMLParser().parseStr` method. **String input:** ``` W33ZpsIOCysn9GGU45y0LW9EpuPHBlAuxCRRusKRvowefQLMy2