botasaurus icon indicating copy to clipboard operation
botasaurus copied to clipboard

The All in One Framework to build Awesome Scrapers.

Results 115 botasaurus issues
Sort by recently updated
recently updated
newest added

Will this project support the intercept function similar to pyppeteer's intercept function? If I need it now, how can I do it myself? intercept request or response

```python 2024-09-08T13:45:25.789162996Z Traceback (most recent call last): 2024-09-08T13:45:25.789376791Z File "/usr/local/lib/python3.10/site-packages/botasaurus/browser_decorator.py", line 179, in run_task 2024-09-08T13:45:25.789382011Z result = func(driver, data) 2024-09-08T13:45:25.789384205Z File "/app/main.py", line 28, in scrape 2024-09-08T13:45:25.789386199Z driver.google_get(data, bypass_cloudflare=True) 2024-09-08T13:45:25.789388072Z...

https://docs.browserless.io/ It would be great if we could connect to browserless instances for scrapes.

Example from Selenium get num tabs - driver.window_handles switch to tab - driver.switch_to.window(tab hash) Please write how to do it in botasaurus?

When going to the site driver.google_get("http://www.etsy.com") and user_agent=UserAgent.RANDOM a captcha appears on the site and writes errors. If you install UserAgent.REAL, the site opens without a captcha and errors ![Screenshot_1](https://github.com/user-attachments/assets/0996046b-a34e-43d6-8469-5c7f4401e6cf)

![image](https://github.com/user-attachments/assets/f3d44b7d-ed1e-4861-8772-bb1ce0adc5f7)

When I init Driver I got error `Running python-BaseException Traceback (most recent call last): File "D:\Programs\PyCharm Community Edition 2023.3.2\plugins\python-ce\helpers\pydev\pydevd.py", line 1570, in _exec pydev_imports.execfile(file, globals, locals) # execute the script...

see config.py in botasaurus_driver: ``` if self.headless: args.append("--headless=new") else: if is_vmish or self.enable_xvfb_virtual_display: # Modified condition from pyvirtualdisplay import Display try: self._display = Display(visible=False, size=(1920, 1080)) self._display.start() except FileNotFoundError: print(...

I am trying to bypass cloudflare. my ip is probably flagged(vpn/proxies), but manual clicks bypass the the detection. I tried running that in a docker container and native on macOS....

https://github.com/omkarcloud/botasaurus-driver/pull/19