scrapy-playwright
scrapy-playwright copied to clipboard
🎭 Playwright integration for Scrapy
### Environment - python 3.10 - OS: macOS 14.1.1, Ubuntu 22.04 LTS - playwright Version 1.42.0 When PLAYWRIGHT_BROWSER_TYPE set as 'chromium' (or default) under macOS, , there appears to be...
## Configurations Configurations 1: Model Name: MacBook Pro Chip: Apple M1 Max Cores: 10 (8 performance and 2 efficiency) Memory: 32GB System Version: macOS Sonoma 14.2.1 Playwright Version: 1.42.0 Python...
I seem to be getting the following issue but I am unsure why the argument passed is invalid? Model Name: MacBook Pro Model Identifier: Mac14,7 Model Number: MNEJ3LL/A Chip: Apple...
After clicking two radio buttons, the page should post-back and display a form. Unfortunately this is not happening. In regular playwright it works. I can't understand why. This is the...
When crawling websites, it can only proceed with the next request after the previous one has finished, acting as if the requests are blocking.
## Description browserless have default browser session timeout as 30s by default https://docs.browserless.io/Docker/docker#connection-timeout, this can be turned off by setting `-1` in the browserless container env variable. However, when it...
Hi! Apologies if I am not writing this correctly. But I just wanted to raise this issue. I am currently running a scrapy spider on my Windows machine, particularly in...
Currently, **scrapy-playwright** only supports Chromium for connecting to remote browser instances over CDP (Chrome DevTools Protocol). Firefox is quite effective in bypassing detections against some anti-bot measures. Is there any...