botasaurus
botasaurus copied to clipboard
The All in One Framework to build Awesome Scrapers.
Unable to bypass invisible turnstile on linux, but on windows it works Any clues ?
When I try to run this code: # ///////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////// from botasaurus.browser import browser, Driver, cdp @browser() def scrape_responses_task(driver: Driver, data): def after_response_handler( request_id: str, response: cdp.network.Response, event: cdp.network.ResponseReceived, ): url...
I have been trying to scrape Upwork and I am unable to bypass the captcha ``` from botasaurus.browser import browser, Driver from selenium.webdriver.common.by import By import re @browser( wait_for_complete_page_load=True, headless=False...
Hello! Thank you for this project. The issue #205 raised the question of using a remote browser, but I was unable to get it to work. When using ```python from...
There currently is no way if doing key presses, for example I want to use tab to move between inputs for more realistic behaviour and input focussing
@browser(run_async=True) def google_search_and_click_sequential(driver: Driver, data=None): query = "httpbin ip httpbin org" driver.enable_human_mode() driver.get(data) el = driver._tab.wait_for('textarea[name="q"]', timeout=40) el.send_keys(query, press_enter=True) mouseClick(driver) time.sleep(2) driver.wait_for_element("a h3", wait=20) first_result = driver.select_all("a h3")[0] first_result.click() driver.short_random_sleep()...
Scrapes URLs in parallel (3 at a time) using browser tabs and rotating proxies. Searches for specific "success keywords" like "purchase", or "sale" on the page content. If any one...
I wasn't able to test this fully. I seem to be having issues passing max_results to cmd line bota? The program ran without issues however did not respect the max_results...
I wrote my own implementation of a Google scraper based on the code provided in this bota readme. Is anyone experiencing issues around scrolling on Google? Did Google change their...
I noticed that the Sitemap function doesn't take into account urls where the sitemap lies in at /sitemap_index.html. A page like denydesigns.com works because it's like sitemap is at /sitemap.xml...