google-images-download icon indicating copy to clipboard operation
google-images-download copied to clipboard

Google limit?

Open manubider opened this issue 6 years ago • 6 comments

Hey! I'm getting a limit of 675 photos when downloading from different keyworkds, what could this be? Does google have a limit around that number? I want to download at least 10000 to train a neural network.

manubider avatar Jun 04 '19 05:06 manubider

I met the same problem.

YueLiao avatar Jun 05 '19 06:06 YueLiao

Until there's a fix, i'd suggest trying to offset keyword. You could even run a loop that calls this function many times and offsets by and additional 675 each time. I haven't tried this though, but i'd be interested to see if it works

ablacklama avatar Jun 23 '19 23:06 ablacklama

@ablacklama Unfortunately all 5000 could not be downloaded because some images were not downloadable. 814 is all we got for this search filter! It couldn't give me a single additional photo..

AmineHosni avatar Aug 01 '19 13:08 AmineHosni

I was wondering why i only got 397 photo for 500 limit on dog husky keyword. So i checked how image search on google page behavior was. It's actually have a button that need to be clicked when it reach 397 images. Maybe this behavior have not been included on latest script, hence the crawler stopped earlier than the real image limit was.

Also, since google image itself is not an unlimited repository of images, be mindful you could reach the last image for those keyword before reaching your defined limit.

refandhika avatar Sep 23 '19 13:09 refandhika

can you solve the problem??

ozcelikkale avatar Dec 05 '19 08:12 ozcelikkale

I believe that the search simply reaches the end of the allowable scrolling range for a google/bing image search. I found this typically occurs after around 500-700 images. Since the selenium functionality of the scraper simply mimics a human using Chrome to conduct an image search manually, I don't believe a workaround is possible, other than possibly searching for slightly different search terms, and then removing duplicates in post-processing.

Screen Shot 2020-02-25 at 10 04 06 PM

glenn-jocher avatar Feb 26 '20 06:02 glenn-jocher