newsworker icon indicating copy to clipboard operation
newsworker copied to clipboard

Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds

Results 16 newsworker issues
Sort by recently updated
recently updated
newest added

Add page analysis command. It should be `feedcmd analysis ` with output of possible feeds on the page and feed types and example feed entities

enhancement

There is no tests in this projects. Add it!

enhancement

Local files date extraction should be supported too. Required to write proper tests

enhancement

Instead of dynamic page structure identification generate a template with a number of options that should simplify data parsing afterward. It should include: - location of the container tag -...

enhancement

The current rule is to use the first link by default. It doesn't work well. Example URL http://pskenergo.ru/news/branch/ instead of a post URL, each time a category URL is detected....

bug

URL https://inspire.ec.europa.eu/news Example: `Monday, January 31, 2022` Need to update qddate patterns

bug