Scrapinghub

Results 29 repositories owned by Scrapinghub

scrapyrt

817
Stars
160
Forks
Watchers

HTTP API for Scrapy spiders

extruct

828
Stars
114
Forks
Watchers

Extract embedded metadata from HTML markup

spidermon

514
Stars
92
Forks
Watchers

Scrapy Extension for monitoring spiders execution.

python-crfsuite

769
Stars
222
Forks
Watchers

A python binding for crfsuite

webstruct

254
Stars
59
Forks
Watchers

NER toolkit for HTML data

scrapy-training

170
Stars
46
Forks
Watchers

Scrapy Training companion code

adblockparser

189
Stars
29
Forks
Watchers

Python parser for Adblock Plus filters

article-extraction-benchmark

236
Stars
28
Forks
Watchers

Article extraction benchmark: dataset and evaluation scripts

dateparser

2.4k
Stars
461
Forks
Watchers

python parser for human readable dates

js2xml

185
Stars
23
Forks
Watchers

Convert Javascript code to an XML document