Jindřich Bär issues

Results 39 issues of


                                            Jindřich Bär

Better KVS streaming support

### Which package is the feature request for? If unsure which one to select, leave blank None ### Feature While the current KVS implementation can work with Node.JS streams (e.g....

feature

t-tooling

feat: creating multiple unnamed queues

There are often reasons to make multiple separate RQs in one Crawlee project (e.g., having `CheerioCrawler` for processing most of the pages and a separate keep-alive `PlaywrightCrawler` instance for processing...

feature

t-tooling

`async` function not awaited in main world execution mode

When evaluating an asynchronous function in the main world execution mode, the return value is always `{ }`. Example: ```python from camoufox.sync_api import Camoufox with Camoufox( main_world_eval=True ) as camoufox:...

bug

feat: use exclusively the cookies from `Session`

Makes the current `Session` instance the single source of cookies for the current request. Closes #2744

fix: certain redirect responses change request methods

Aligns the `@crawlee/impit-client` implementation with the respective RFC and browsers' behaviour. Closes #2586

adhoc

t-tooling

`FileDownload` waits indefinitely on unconsumed stream

Due to the design of Crawlee request handlers, the user-supplied request handler can return before the response stream is consumed. Because of this, we are waiting until the stream is...

t-tooling

Remove `got-scraping` as `crawlee` dependency

With the `HttpClient` abstraction, we now allow users to switch HTTP client implementations in a standardized manner. E.g., the `HttpCrawler` implementation still contains references to `got-scraping`, which forces us to...

t-tooling

Improve the `Session` reuse strategy

The current `SessionPool` implementation generates a large number of `Session` instances before ever reusing one. This limits the use cases for this class and wastes resources. Related discussion (https://github.com/apify/crawlee/pull/3199/files#r2452714909). Closes...

t-tooling

Refactor cookie handling

In a fashion similar to https://github.com/apify/crawlee/issues/3198, we should extract cookie handling from all parts of Crawlee and treat the `Session` instances as the single source of truth for the current...

t-tooling

Multiple crawler instances share `useState` state

### Which package is this bug report for? If unsure which one to select, leave blank @crawlee/basic (BasicCrawler) ### Issue description When instantiating multiple crawler instances at once, their `useState`...

bug

t-tooling