scrapy-cluster issues

Adding support for issue #182

3

Adding support for 1.Custom Headers and Cookies with Initial request 2.Shared cookies middleware to share cookies between crawl nodes Linked Issue #182

knirbhay

Support for custom header and cookies for the initial request from kafka_monitor.py feed

5

I needed to request an URL with custom header and preset cookies. eg. There is an API at `https://xyz.com/test_api/_id` which returns a json. and this should be called with api...

knirbhay

crawler

feature request

Uitest

21

Added UI testing using selenium and python

backtrack-5

Checklist for items that I know need worked on before the [ui](https://github.com/istresearch/scrapy-cluster/tree/ui) branch can be merged into the [dev](https://github.com/istresearch/scrapy-cluster/tree/dev) branch - [x] Create documentation - [x] Add offline unit tests...

madisonb

help wanted

documentation

unit testing

ui

Circuit breaker design patterns

Lots of the individual components break down or crash when their required infrastructure is not available. They are dependent on kafka, redis, or zookeeper, but don't have good mechanisms always...

madisonb

crawler

kafka-monitor

redis-monitor

unit testing

Upgrade the project to python 3.10

4

Upgrade the project to python 3.10

borisjota

ERROR: Unable to connect to Kafka in Pipeline due to attempt to connect already-connected SSLSocket!, raising exit flag.

1

I ran the Scrapy Cluster spider start code and I ended up getting this error message, I have no idea what this could be and have troubleshooted for a while....

BeamoINT

scrapy-cluster
scrapy-cluster copied to clipboard

Metadata

Adding support for issue #182

Support for custom header and cookies for the initial request from kafka_monitor.py feed

Uitest

UI Integration Checklist

Circuit breaker design patterns

Upgrade the project to python 3.10

ERROR: Unable to connect to Kafka in Pipeline due to attempt to connect already-connected SSLSocket!, raising exit flag.

← Metadata

Owner

Metadata

scrapy-cluster scrapy-cluster copied to clipboard

Metadata

Adding support for issue #182

Support for custom header and cookies for the initial request from kafka_monitor.py feed

Uitest

UI Integration Checklist

Circuit breaker design patterns

Upgrade the project to python 3.10

ERROR: Unable to connect to Kafka in Pipeline due to attempt to connect already-connected SSLSocket!, raising exit flag.

← Metadata

Owner

Metadata

scrapy-cluster
scrapy-cluster copied to clipboard