html-parsing topic
interweave
🌀 React library to safely render HTML, filter attributes, autowrap text with matchers, render emoji characters, and much more.
Fuzi
A fast & lightweight XML & HTML parser in Swift with XPath & CSS support
htmldate
Fast and robust date extraction from web pages, with Python or on the command-line
jusText
Heuristic based boilerplate removal tool
parse5
HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant.
goquery
A little like that j-thing, only in Go.
breadability
Reworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)
scala-scraper
A Scala library for scraping content from HTML pages
XML-Parser
A Node.js XML DOM, Parser & Stringifier.
HTMLp
Delphi Dom HTML Parser and Converter. Fork (not from the original author): https://sourceforge.net/projects/htmlp/