Josef Procházka
Josef Procházka
Currently if we use crawlee to create python Project. `crawlee create` and then within this project we call `apify init` we get surprising warning: `Warning: The current directory does not...
### Description Add `validate_input` endpoint ### Issues - Closes: #151
_ActorType aliased as Actor is very big class with many methods. https://github.com/apify/apify-sdk-python/blob/master/src/apify/_actor.py#L42 It is entry point of SDK and it would be nice if it was nicely structured. Having too...
Do not have src fodler in the templates, but rather use package name like it is done in crawlee templates. Will be done instead of this: https://github.com/apify/crawlee-python/issues/1113 after [cli issue](https://github.com/apify/apify-cli/issues/746)...
In situation where scrapy template explodes and produces no results, template test passes. It is because test framework checks only for [return code of template script](https://github.com/apify/actor-templates/blob/master/test/templates.test.js#L29) and the scrapy code...
Hi, when testing FingerprintGenerator I have seen that it can fail to generate fingerprints with error: "**ValueError: Cannot generate headers. User-Agent may be invalid, or screen constraints are too restrictive.**"...
### Description Use `Unique Set Size (USS)` to estimate children process memory usage to avoid overestimation of used memory due to shared memory being counted multiple times when using `Resident...
## Problem: client_info=1 of autoscaled pool can completely stop the crawler, without any recovery. Investigate and fix if not desired. ## Context: A benchmark test was running for Parsel crawler....
`RequestQueue` can log following line even in situation where there is only one client and thus no other client exists: `The queue still contains requests locked by another client` For...