article-extracting topic
ftr-site-config
Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
NewsCatchr
FOSS Android News Reader App
SmartReader
SmartReader is a library to extract the main content of a web page, based on a port of the Readability library by Mozilla
markdown_articles_tool
Parse markdown article, download images and replace images URL's with local paths
article-parser
Extract article or news by url or html, parse the title and content, output in markdown format.
newshound
This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages.
html-article-extractor
A web page content extractor
dnlp
π Π‘Π±ΠΎΡΠ½ΠΈΠΊ ΠΏΠΎΠ»Π΅Π·Π½ΡΡ ΡΡΡΠΊ ΠΈΠ· Natural Language Processing: ΠΠΏΡΠ΅Π΄Π΅Π»Π΅Π½ΠΈΠ΅ ΡΠ·ΡΠΊΠ° ΡΠ΅ΠΊΡΡΠ°, Π Π°Π·Π΄Π΅Π»Π΅Π½ΠΈΠ΅ ΡΠ΅ΠΊΡΡΠ° Π½Π° ΠΏΡΠ΅Π΄Π»ΠΎΠΆΠ΅Π½ΠΈΡ, ΠΠΎΠ»ΡΡΠ΅Π½ΠΈΠ΅ ΠΎΡΠ½ΠΎΠ²Π½ΠΎΠ³ΠΎ ΡΠΎΠ΄Π΅ΡΠΆΠΈΠΌΠΎΠ³ΠΎ ΠΈΠ· html Π΄ΠΎΠΊΡΠΌΠ΅Π½ΡΠ°