ExplainToMe icon indicating copy to clipboard operation
ExplainToMe copied to clipboard

RSS/Atom Scraper

Open jjangsangy opened this issue 7 years ago • 0 comments

Scrape and RSS/Atom Feeds

Other structured site content like

  • OpenGraph
  • HTML Element/Attribute data
  • Sitemap.xml

Current Content Extraction extracts using DOM parsing (Frequency based) and text heuristics (Goose).

Data can be used to generate entire feed info be used as context to direct algorithm selection in a command a pattern.

jjangsangy avatar Aug 17 '16 02:08 jjangsangy