scrapy-cluster icon indicating copy to clipboard operation
scrapy-cluster copied to clipboard

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.

Results 17 scrapy-cluster issues
Sort by recently updated
recently updated
newest added

use ui mode exception stack info,but browser normal File "ui_service.py", line 121, in _kafka_stats r = requests.post(self.settings['REST_HOST'] + "/feed", json=data) File "/usr/local/lib/python2.7/site-packages/requests/api.py", line 110, in post return request('post', url, data=data,...

Great project, thanks for sharing - and supporting for so long! I ran into a few problems running the ELK stack - the Elasticsearch container kept restarting with `java.lang.IllegalStateException` [docker-elk-logs.txt](https://github.com/istresearch/scrapy-cluster/files/6015465/docker-elk-logs.txt)...

Hi. I don't know how much it happens or how much it already happenned but one of my crawl fell on the error below. I ran a thousand requests and...

Hi. I've just come across this project and it is exactly what we need. However, I've noticed there haven't been any updates for a while now. Could you guys please...

Issue to register the request to migrate fully to Python 3.

Adding a custom download middleware distributed_cookies.DistributedCookiesMiddleware that uses Redis to set and get the serialized cookie. Adding a custom spider middleware distributed_cookies.ClearCookiesMiddleware that delete cookie if an item with the...

We need a small stand-alone web UI that ties in with the rest components in #24 to visualize the data generated by the cluster. You should also be able to...

enhancement
help wanted
ui
feature request