Scrapinghub

Results 29 repositories owned by Scrapinghub

learn.scrapinghub.com

55
Stars
24
Forks
Watchers

Scrapinghub Learning Center. Report issues in Jira: Report issues in Jira: https://scrapinghub.atlassian.net/projects/WEB

web-poet

90
Stars
14
Forks
Watchers

Web scraping Page Objects core library

aduana

53
Stars
8
Forks
Watchers

Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even when making big crawls (one billion pages).

aile

88
Stars
16
Forks
Watchers

Automatic Item List Extraction

autoextract-spiders

20
Stars
15
Forks
Watchers

Pre-built Scrapy spiders for AutoExtract

crawlera-tools

26
Stars
10
Forks
Watchers

Crawlera tools

docker-devpi

28
Stars
44
Forks
Watchers

pypi caching service using devpi and docker

docker-images

32
Stars
8
Forks
Watchers

exporters

40
Stars
10
Forks
Watchers

Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations