icrawler issues

catch DecompressionBomb(warning/exception)

some website return abnormal image, which may cause the death of download thread

parser error(?) in flickr crawler

1

When I command to crawl 1000 images, I got message around 500th image. That means there's no more images? But when I search on flickr site, there are more hundreds...

kyung-wook

bug

The interval for icrawler

1

First, I want to say thank you. The icrawler does have me a lot. I have read the source code, but didn't find a way to specify the interval for...

h0nestliar

feature

Result searching by website different from results from icrawler in BaiduImage

when I use the keyword: '热水器+浴室' to search on the website of Baidu, I got the correct result: ![微信截图_20190311132516](https://user-images.githubusercontent.com/13494034/54103799-b8101680-4408-11e9-98ee-bc9c536343ab.png) however, when I use the same keyword in icrawler, I got...

jamiechoi1995

bug

needs reproduce

Accessing task_queue

I am trying to access task_queue to access the task dictionary. But i am unable to, so can someone please suggest how to go about it. Thanks

syed-asnal01

question

How to change root_dir in storage argument?

3

I was try to change the `root_dir` by the following: ` google_crawler = GoogleImageCrawler( feeder_threads=1, parser_threads=1, downloader_threads=4, storage=storage) ` ` google_crawler.set_storage(new_storage) ` But it doesn't seem to work. Did I...

tienthegainz

bug

needs reproduce

not able to download images of multiple class?

1

how to download images of multiple classs?

abhinavsp0730

question

Multiple Color, Size and License ?

How to use multiple Color, Size and License ?

FerdinaKusumah

question

Image files by Bing Crawler were collapsed

1

As Google Crawler doesn't work properly at this momont, I tried to use Bing Crawler. But all downloaded image files were collapsed. It worked well in the last month. Will...

masa126

bug

needs reproduce

Scrape metadata with the built-in Flickr crawler

class MyImageDownloader(ImageDownloader): def __init__(self, thread_num, signal, session, storage, log_file): super(MyDownloader, self).__init__(thread_num, signal, session, storage) self.log_file = open(log_file, 'w') def process_meta(self, task): if task['success']: with self.lock: self.log_file.write('{} {} {} {}\n'.format( task['filename'],...

spencerchubb

question

icrawler
icrawler copied to clipboard

Metadata

catch DecompressionBomb(warning/exception)

parser error(?) in flickr crawler

The interval for icrawler

Result searching by website different from results from icrawler in BaiduImage

Accessing task_queue

How to change root_dir in storage argument?

not able to download images of multiple class?

Multiple Color, Size and License ?

Image files by Bing Crawler were collapsed

Scrape metadata with the built-in Flickr crawler

← Metadata

Owner

Metadata

icrawler icrawler copied to clipboard

Metadata

← Metadata

Owner

Metadata

icrawler
icrawler copied to clipboard