ganilla icon indicating copy to clipboard operation
ganilla copied to clipboard

Illustration dataset could not be downloaded / Crawler raises an exception

Open gsoykan opened this issue 3 years ago • 2 comments

I have tried to download the illustration dataset according to instruction and make the crawler work. However, it appears that it does not download any file at all. Maybe because of some change in openlibrary.org. Also, the script does not seem to run till the end and it raises below exception. I am also attaching the correspondent browser state. Traceback (most recent call last): File "openlibraryImageDownloaderMain.py", line 48, in <module> main(opts) File "openlibraryImageDownloaderMain.py", line 26, in main olh.search_author(illustrator, dir_name, lower_case_list) File "~/ganilla/datasets/scraper_openlibrary.py", line 52, in search_author search_res = self.browser.find_element_by_id("searchResults") File "~/selenium/webdriver/remote/webdriver.py", line 360, in find_element_by_id return self.find_element(by=By.ID, value=id_) File "~/selenium/webdriver/remote/webdriver.py", line 978, in find_element 'value': value})['value'] File "~/selenium/webdriver/remote/webdriver.py", line 321, in execute self.error_handler.check_response(response) File "~/selenium/webdriver/remote/errorhandler.py", line 242, in check_response raise exception_class(message, screen, stacktrace) selenium.common.exceptions.NoSuchElementException: Message: Unable to locate element: [id="searchResults"] image

gsoykan avatar Apr 08 '21 08:04 gsoykan

Can you downloaded the Illustration datasets?

741736152 avatar Sep 02 '21 03:09 741736152

Hello, Yes indeed looks like the website changed the interface since you got this error: "Message: Unable to locate element: [id="searchResults"]". I am sorry but you need to debug step by step and try to locate the objects by inspecting the website. And if you managed to make it working again please open a PR. Cheers!

giddyyupp avatar Oct 25 '23 18:10 giddyyupp