pggan-pytorch icon indicating copy to clipboard operation
pggan-pytorch copied to clipboard

Is there some way to download the images for Celeba-HQ

Open pjreddie opened this issue 7 years ago • 15 comments

Or if you can make them available to me I can host them somewhere for people to download. I'd really like the dataset but don't particularly want to go through the process to generate it myself.

pjreddie avatar Feb 21 '18 01:02 pjreddie

The authors released the dataset, and here is the link https://drive.google.com/open?id=0B4qLcYyJmiz0TXY1NG02bzZVRGs

please refer to this repo . https://github.com/tkarras/progressive_growing_of_gans

nashory avatar Feb 23 '18 00:02 nashory

@nashory seems it is only the pretrained model, not the source images

petergerten avatar Mar 14 '18 09:03 petergerten

@petergerten aren't they stored in the .dat files in the datasets directory? https://drive.google.com/drive/folders/1iGK3jDe_1-V0Lyy5468WNpb7y74u-iEp

darthdeus avatar Mar 27 '18 12:03 darthdeus

The .dat files from google drive only contain the difference between the celebA and the celebA-HQ images. The process to create the images is actually quite complicated. I just made my scripts available at https://github.com/nperraud/download-celebA-HQ They are far from perfect but I hope it helps.

nperraud avatar Apr 25 '18 07:04 nperraud

Hi @nperraud! Thanks for sharing the scripts. I am trying to download the deltas from Google Drive but it gives me an error that the "download quota exceeded for this file" for all files of course. I guess there is a limit to the number of downloads per day for Google Drive. Do you have any idea if I can find the files stored somewhere else?

BKHMSI avatar May 24 '19 20:05 BKHMSI

I generated the files and stored it as convenient zip files. You can download from Google Drive or generate using pre-built docker image. Visit suvojit-0x55aa/celebA-HQ-dataset-download

suvojit-0x55aa avatar Aug 04 '19 05:08 suvojit-0x55aa

@suvojit-0x55aa there should be 200k images in Celeba HQ,but your sharing are 30k of them.

shartoo avatar Aug 05 '19 07:08 shartoo

@shartoo As quoted in the Progressive Growing of GANs paper

In this section we describe the process we used to create the high-quality version of the CELEBA dataset, consisting of 30000 images in 1024 × 1024 resolution.

The Celeba HQ dataset contains only 30k images.

suvojit-0x55aa avatar Aug 05 '19 07:08 suvojit-0x55aa

sorry for my careless check,you're right.

shartoo avatar Aug 05 '19 07:08 shartoo

Another question,how could i use the list_attr_celaba.txt namely attribution label for celaba-hq.Does the 30k images of celeba-hq are first 30k of 200k images in celaba dataset?

shartoo avatar Aug 05 '19 08:08 shartoo

Got it ,there is a image list file https://raw.githubusercontent.com/nperraud/download-celebA-HQ/master/image_list.txt which releated celaba and celaba-hq files.

shartoo avatar Aug 05 '19 08:08 shartoo

@shartoo As quoted in the Progressive Growing of GANs paper

In this section we describe the process we used to create the high-quality version of the CELEBA dataset, consisting of 30000 images in 1024 × 1024 resolution.

The Celeba HQ dataset contains only 30k images.

Is It possible to share labels with the dataset? Thanks!

Cuilie avatar Nov 29 '19 03:11 Cuilie

@Cuilie you can use this file.

suvojit-0x55aa avatar Nov 29 '19 10:11 suvojit-0x55aa

@suvojit-0x55aa On your github page you said "The size of the final dataset is 89G". But in Google Drive, you have far less than 89Gb. How come?

aciobanusebi avatar May 14 '20 09:05 aciobanusebi

@aciobanusebi as far as I remember that's the memory needed for raw dataset, the deltas, and the generated dataset.

suvojit-0x55aa avatar May 14 '20 10:05 suvojit-0x55aa