scrapyrt icon indicating copy to clipboard operation
scrapyrt copied to clipboard

HTTP API for Scrapy spiders

Results 34 scrapyrt issues
Sort by recently updated
recently updated
newest added

Allow multiple urls crawling within a single API request. I need to be able to parse multiple url with a single request, so I've decided to separate them with a...

Hi, scrapyrt works very well for me, but I would like to use it with the autothrottle extension. Why is it disabled by default? Is there an easy way to...

bug

it will default to logging.DEBUG but users can set this to something higher in case they dont want all the noise. fixes #30 for future we can also add command...

enhancement

Collects not only spider exceptions, but also other kinds of errors that appear in logs. E.g.: ``` { "status": "ok", "errors": [ "Ignoring response : HTTP status code is not...

enhancement

We should add some command line argument to allow to customize logging, this would be useful for running scrapyrt in production but also good for apps that inherit from scrapyrt...

enhancement

I'm experiencing difficulties in accessing a ScrapyRT service running on specific ports within a Kubernetes pod. My setup includes a Kubernetes cluster with a pod running a Scrapy application, which...

more info needed

Also reflect Python 3.12 support; add some env vars default fallbacks for common CLI args; and fix a `os.path.join` to be properly cross-platform As for the `package` CLI arg, you...

Are there any specific features or bug fixes that you (users, developers, anyone) consider important, top priority? Please share information here in thread. If someone would like to volunteer to...

more info needed

As of January 2023, is this still maintained or is it safe to use in its current state?

Hi! thanks for your work on Scrapyrt! I've discovered that spiders served by Scrapyrt don't save the output in the Spider's / custom_settings / FEEDS. Is it possible to change...

more info needed