botasaurus icon indicating copy to clipboard operation
botasaurus copied to clipboard

The All in One Framework to build Awesome Scrapers.

Results 115 botasaurus issues
Sort by recently updated
recently updated
newest added

Unable to bypass invisible turnstile on linux, but on windows it works Any clues ?

When I try to run this code: # ///////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////// from botasaurus.browser import browser, Driver, cdp @browser() def scrape_responses_task(driver: Driver, data): def after_response_handler( request_id: str, response: cdp.network.Response, event: cdp.network.ResponseReceived, ): url...

I have been trying to scrape Upwork and I am unable to bypass the captcha ``` from botasaurus.browser import browser, Driver from selenium.webdriver.common.by import By import re @browser( wait_for_complete_page_load=True, headless=False...

Hello! Thank you for this project. The issue #205 raised the question of using a remote browser, but I was unable to get it to work. When using ```python from...

There currently is no way if doing key presses, for example I want to use tab to move between inputs for more realistic behaviour and input focussing

@browser(run_async=True) def google_search_and_click_sequential(driver: Driver, data=None): query = "httpbin ip httpbin org" driver.enable_human_mode() driver.get(data) el = driver._tab.wait_for('textarea[name="q"]', timeout=40) el.send_keys(query, press_enter=True) mouseClick(driver) time.sleep(2) driver.wait_for_element("a h3", wait=20) first_result = driver.select_all("a h3")[0] first_result.click() driver.short_random_sleep()...

Scrapes URLs in parallel (3 at a time) using browser tabs and rotating proxies. Searches for specific "success keywords" like "purchase", or "sale" on the page content. If any one...

I wasn't able to test this fully. I seem to be having issues passing max_results to cmd line bota? The program ran without issues however did not respect the max_results...

I wrote my own implementation of a Google scraper based on the code provided in this bota readme. Is anyone experiencing issues around scrolling on Google? Did Google change their...

I noticed that the Sitemap function doesn't take into account urls where the sitemap lies in at /sitemap_index.html. A page like denydesigns.com works because it's like sitemap is at /sitemap.xml...