botasaurus
botasaurus copied to clipboard
The All in One Framework to build Awesome Scrapers.
Will this project support the intercept function similar to pyppeteer's intercept function? If I need it now, how can I do it myself? intercept request or response
```python 2024-09-08T13:45:25.789162996Z Traceback (most recent call last): 2024-09-08T13:45:25.789376791Z File "/usr/local/lib/python3.10/site-packages/botasaurus/browser_decorator.py", line 179, in run_task 2024-09-08T13:45:25.789382011Z result = func(driver, data) 2024-09-08T13:45:25.789384205Z File "/app/main.py", line 28, in scrape 2024-09-08T13:45:25.789386199Z driver.google_get(data, bypass_cloudflare=True) 2024-09-08T13:45:25.789388072Z...
https://docs.browserless.io/ It would be great if we could connect to browserless instances for scrapes.
Example from Selenium get num tabs - driver.window_handles switch to tab - driver.switch_to.window(tab hash) Please write how to do it in botasaurus?
When going to the site driver.google_get("http://www.etsy.com") and user_agent=UserAgent.RANDOM a captcha appears on the site and writes errors. If you install UserAgent.REAL, the site opens without a captcha and errors 

When I init Driver I got error `Running python-BaseException Traceback (most recent call last): File "D:\Programs\PyCharm Community Edition 2023.3.2\plugins\python-ce\helpers\pydev\pydevd.py", line 1570, in _exec pydev_imports.execfile(file, globals, locals) # execute the script...
see config.py in botasaurus_driver: ``` if self.headless: args.append("--headless=new") else: if is_vmish or self.enable_xvfb_virtual_display: # Modified condition from pyvirtualdisplay import Display try: self._display = Display(visible=False, size=(1920, 1080)) self._display.start() except FileNotFoundError: print(...
I am trying to bypass cloudflare. my ip is probably flagged(vpn/proxies), but manual clicks bypass the the detection. I tried running that in a docker container and native on macOS....
https://github.com/omkarcloud/botasaurus-driver/pull/19