tencent-ml-images icon indicating copy to clipboard operation
tencent-ml-images copied to clipboard

So many image links are invalid

Open gongwk opened this issue 6 years ago • 5 comments

I run the mutithreading shell in windows with virtual linux env, but the terminal output shows that so many image urls are invalid. 1218 images in total, but i just download 228. image In addtion,in 228 downloaded images, some images are still invalid. image how can I solve this problem and get enough images to do some research.

gongwk avatar Oct 22 '18 05:10 gongwk

@gongwk The reason is that the URLs from ImageNet are too old, thus many are invalid. As suggested in our README, you can download the whole database of ImageNet (http://image-net.org/download), then collect the URLs used in our database, according to 'train_urls_and_index_from_imagenet.txt'. For other URLs from Open Images, you can download them directly.

wubaoyuan avatar Oct 22 '18 11:10 wubaoyuan

@wubaoyuan I‘ll try, thanks.

gongwk avatar Oct 23 '18 01:10 gongwk

@gongwk Please see the latest README, there is a more clear demonstration about the invalid URLs

wubaoyuan avatar Oct 23 '18 02:10 wubaoyuan

@wubaoyuan I‘ll try, thanks.

Do you download all image successfully? Dose all invalid image url can be find in Imagenet?

Zeitzmz avatar Oct 25 '18 03:10 Zeitzmz

@Zeitzmz YES, please see "Download the original images of the whole database of ImageNet from (http://image-net.org/download), and the corresponding URL file is List of all image URLs of Fall 2011 Release (see http://image-net.org/download-imageurls)"

wubaoyuan avatar Oct 25 '18 04:10 wubaoyuan