IMDb-Face icon indicating copy to clipboard operation
IMDb-Face copied to clipboard

Downloading is too slow.

Open LCorleone opened this issue 7 years ago • 12 comments

Great job! I use python urllib. Maybe I am in China, the url for downloading is too slow. Is there any way to deal with it? or is there anyone to share the dataset? Thanks.

LCorleone avatar Sep 20 '18 13:09 LCorleone

I think you can use proxy servers to accelerate your access to these images on Amazon.

hwfan avatar Sep 25 '18 09:09 hwfan

@LCorleone You can use multi-threading or proxy servers in python to speed up downloading images. The code is available.

braveapple avatar Sep 30 '18 08:09 braveapple

@BraveApple Thanks,nice work!

LCorleone avatar Oct 06 '18 07:10 LCorleone

i dont know why i must use proxy servers to download picture, it is maddening when requests post break

smartwell avatar Oct 24 '18 03:10 smartwell

i dont know why i must use proxy servers to download picture, it is maddening when requests post break

i give up

smartwell avatar Oct 24 '18 03:10 smartwell

Actually, the best way to download such datasets is to use cloud server. I used to use AWS to do this. However, there is still a problem waiting for us. It's very slow to upload datasets to our computer in China. Even using bypy to do this, it still sucks!

wangx404 avatar Nov 08 '18 06:11 wangx404

I wrote a script yesterday to download the dataset on AWS. After 12 hours, 600k images have been downloaded. (About 20% of the image links no long exist.) Even I have croped the face from the raw image, the dataset is still very huge. I think it would have a size of 55G when the whole dataset was downloaded.

wangx404 avatar Nov 09 '18 01:11 wangx404

Finally, I finished. It's about 50G, with about 17% links expired.

wangx404 avatar Nov 11 '18 05:11 wangx404

@wangx404 could you share the cropped data?

superzrx avatar Mar 06 '19 07:03 superzrx

Could someone kindly upload the downloaded data to BaiduYun ?

danielkaifeng avatar Apr 02 '19 08:04 danielkaifeng

why IMDb-Face.csv only has 1048576 images?have u downloaded all dataset?

Dantju avatar Apr 08 '19 06:04 Dantju

@wangx404 Could you share your download data to BaiduYun? many thx.

xianyujie avatar Jun 11 '19 03:06 xianyujie