news-crawler topic

List news-crawler repositories

trafilatura

3.0k
Stars
228
Forks
Watchers

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

news-please

2.0k
Stars
405
Forks
Watchers

news-please - an integrated web crawler and information extractor for news that just works

KoreaNewsCrawler

217
Stars
104
Forks
Watchers

A korean news crawler built to ingest large amounts of news data.

news-crawler

103
Stars
42
Forks
Watchers

A news crawler for BBC News, Reuters and New York Times.

newshound

29
Stars
3
Forks
Watchers

This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages.

news-sentiment-analysis

20
Stars
5
Forks
Watchers

The spider crawls moneycontrol.com and economictimes.com to fetch news of input companies and also scores and classifies the companies to raise an early warning signal

NewsFeeds

21
Stars
11
Forks
Watchers

Newsfeeds website using nodejs as server and mongo as storage backends, including a simple recommendation system. 基于Node.js的新闻聚合网站, 支持基于用户行为推荐新闻.

fundus

122
Stars
62
Forks
Watchers

A very simple news crawler with a funny name