botasaurus
botasaurus copied to clipboard
The All in One Framework to build Awesome Scrapers.
If I want to scrape a html element with some class how do I save those element data?
I think I need to add here: ``` def create_options_and_driver_attributes_and_close_proxy(.....,headless): ...... elif is_docker(): options.add_argument('--no-sandbox') if headless: options.add_argument('--headless=new') # options.add_argument('--disable-setuid-sandbox') ``` I needed it, I had to fork it. or add...
*URL: https://www.g2.com/categories/crm?order=g2_score&page=3#product-list *For this url It is not able to bypass cloudflare (Reason - verification is taking too long)
Unable to pass Steamdb's detection, manually clicking on the checkbox is also useless
I just updated the package to latest version. before it was 3.2.7 and it worked fine which bypassed 9/10. but now it is bypassing only 1/10. not able to bypass...
Hello, i'm currently trying to integrate this in AWS lambda and the problem i found is lambda cannot write into any directory other than `/tmp/` so i want to change...
Included detail to obtain HTML source code in string format.
Hi, I created with my friends an open source library for making scraping with the use of the ai (https://github.com/VinciGit00/Scrapegraph-ai), if u want I can implement you the functions for...
By integrating custom scraping task `_request_scrape_data` into a class in this way ```python class RequestCrawlasaurus(): def __init__(self, start_url=""): self.start_url= start_url @request( output=None, create_error_logs=False, raise_exception=True, parallel=3, ) def _request_scrape_data(self, request: AntiDetectDriver,...
Error in create_driver_utils.py, line 226, in create_selenium_driver: `NameError: name 'SessionNotCreatedException' is not defined` Fix was to just import the correct exception type.