Georgiy Zatserklianyi comments

Results 37 comments of


                                            Georgiy Zatserklianyi

Per slot settings

This PR is ready for review. Tests for per slot settings implemented using mockserver.

Support per-request download handler override

I suppose it can be implemented by updating code here: https://github.com/scrapy/scrapy/blob/23537a0f9580bfb28ac5d8b88f37df47e838f463/scrapy/core/downloader/handlers/__init__.py#L70-L75

Support per-request download handler override

@Gallaecio > It would be great if a plugin like https://github.com/scrapy-plugins/scrapy-playwright did not had to force you to drive all requests through its download handlers, and instead you could drive...

Set an upper limit for the cache, otherwise it will cause oom

@Gallaecio @Duckweeds7 I have some concers about it > or crawls a large number of tasks, it will lead to info.downloaded becomes very large If total size of all downloaded...

Wrong type(response) for binary responses

@ejulio, @Gallaecio `ScrapyAgent._cb_bodydone` method.. chooses response class: https://github.com/scrapy/scrapy/blob/e22a8c8c36e34ffaf12ef9e330624df654582605/scrapy/core/downloader/handlers/http11.py#L395-L400 `responsetypes.from_args` method performs several checks with following logic: - default response type is plain `Response`, - if scrapy identify something that require...

Support multiple download slots for the same request

> Imagine those sites only support low traffic, so you want to limit concurrency to 2 per site. Also imagine that your Splash instance can only handle up to 3...

Use priority queues for Downloader slot queues

Lets test this script with various settings script ```python import scrapy; from scrapy.crawler import CrawlerProcess class BooksToScrapeSpider(scrapy.Spider): name = "books"; start_urls = [f"https://books.toscrape.com/catalogue/page-{i}.html" for i in range(1,32)] custom_settings = {"DOWNLOAD_DELAY":1}...

Georgiy Zatserklianyi

Per slot settings

Support per-request download handler override

Support per-request download handler override

Set an upper limit for the cache, otherwise it will cause oom

Wrong type(response) for binary responses

Support multiple download slots for the same request

Use priority queues for Downloader slot queues

Create root node memory 210

selector create_root_node memory issues

selector create_root_node memory issues