crawlee
crawlee copied to clipboard
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
[](https://renovatebot.com) This PR contains the following updates: | Package | Change | Age | Adoption | Passing | Confidence | |---|---|---|---|---|---| | [turbo](https://turbo.build/repo) ([source](https://togithub.com/vercel/turbo)) | [`1.13.3` -> `2.0.6`](https://renovatebot.com/diffs/npm/turbo/1.13.3/2.0.6) |...
Allows to opt-out from the new `iframe` expansion feature in `parseWithCheerio` from https://github.com/apify/crawlee/commit/328d08598807782b3712bd543e394fe9a000a85d
### Which package is this bug report for? If unsure which one to select, leave blank @crawlee/playwright (PlaywrightCrawler) ### Issue description I'm: 1. Running a node app with worker threads...
### Which package is this bug report for? If unsure which one to select, leave blank None ### Issue description Hi all, Is the following a bug? I'm noticing that...
### Which package is this bug report for? If unsure which one to select, leave blank @crawlee/browser (BrowserCrawler) ### Issue description According to the documentation the proxies and sessions are...
I updated the Open Graph Parser so that it can parse arrays of images with properties and present them as arrays of image objects with properties. There are other Open...
### Which package is this bug report for? If unsure which one to select, leave blank @crawlee/playwright (PlaywrightCrawler) ### Issue description use `enqueueLinks()` without any parameters in the request handler...
[](https://renovatebot.com) This PR contains the following updates: | Package | Change | Age | Adoption | Passing | Confidence | |---|---|---|---|---|---| | [turbo](https://turbo.build/repo) ([source](https://togithub.com/vercel/turborepo)) | [`1.13.3` -> `2.0.14`](https://renovatebot.com/diffs/npm/turbo/1.13.3/2.0.14) |...
### Which package is this bug report for? If unsure which one to select, leave blank @crawlee/cheerio (CheerioCrawler) ### Issue description The CheerioCrawler is not persisting cookies at all. The...
[](https://renovatebot.com) This PR contains the following updates: | Package | Change | Age | Adoption | Passing | Confidence | |---|---|---|---|---|---| | [puppeteer](https://togithub.com/puppeteer/puppeteer/tree/main#readme) ([source](https://togithub.com/puppeteer/puppeteer)) | [`22.12.0` -> `23.2.0`](https://renovatebot.com/diffs/npm/puppeteer/22.12.0/23.2.0) |...