article-extractor topic
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
article-extractor
To extract main article from given URL with Node.js
php-goose
Readability / Html Content / Article Extractor & Web Scrapping library written in PHP
SmartReader
SmartReader is a library to extract the main content of a web page, based on a port of the Readability library by Mozilla
markdown_articles_tool
Parse markdown article, download images and replace images URL's with local paths
nlpserver
NLP Web Service
sneakpeek
Reddit bot to preview and post hyperlinks as comments
newshound
This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages.
laravel-nlp
Laravel wrapper for common NLP tasks