Scrapinghub
Results
29
repositories owned by
Scrapinghub
flatson
32
Stars
7
Forks
Watchers
Tool to flatten stream of JSON-like objects, configured via schema
mdr
110
Stars
30
Forks
Watchers
A python library detect and extract listing data from HTML page.
page_clustering
35
Stars
8
Forks
Watchers
A simple algorithm for clustering web pages, suitable for crawlers
page_finder
30
Stars
10
Forks
Watchers
Find which links on a web page are pagination links