news-crawler topic
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
news-please
news-please - an integrated web crawler and information extractor for news that just works
KoreaNewsCrawler
A korean news crawler built to ingest large amounts of news data.
news-crawler
A news crawler for BBC News, Reuters and New York Times.
google-news-scraper
Lightweight scraper for Google News
newshound
This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages.
news-sentiment-analysis
The spider crawls moneycontrol.com and economictimes.com to fetch news of input companies and also scores and classifies the companies to raise an early warning signal
NewsFeeds
Newsfeeds website using nodejs as server and mongo as storage backends, including a simple recommendation system. 基于Node.js的新闻聚合网站, 支持基于用户行为推荐新闻.
fundus
A very simple news crawler with a funny name