webcrawler topic

List webcrawler repositories

coronavirus

61
Stars
8
Forks
Watchers

2019 nCoV realtime track system based Scrapy + influxdb + grafana + NLTK + Stanford CoreNLP

gafanhoto

50
Stars
6
Forks
Watchers

Bot para monitoramento de promoções no fórum do Hardmob http://www.hardmob.com.br/promocoes/

crawlab

10.9k
Stars
1.7k
Forks
Watchers

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架

spider-flow

9.2k
Stars
1.8k
Forks
Watchers

新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。

scrapyrt

817
Stars
160
Forks
Watchers

HTTP API for Scrapy spiders

crawler-china-mainland-universities

165
Stars
51
Forks
Watchers

中国大陆大学列表爬虫

skycaiji

1.9k
Stars
571
Forks
Watchers

蓝天采集器是一款开源免费的爬虫系统,仅需点选编辑规则即可采集数据,可运行在本地、虚拟主机或云服务器中,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页...

php-crawler

134
Stars
65
Forks
Watchers

A php crawler that finds emails on the internets

opensearchserver

499
Stars
190
Forks
Watchers

Open-source Enterprise Grade Search Engine Software