DCTNet icon indicating copy to clipboard operation
DCTNet copied to clipboard

Dataset Version Used

Open odb2 opened this issue 4 years ago • 3 comments

I have a question with the dataset used in this project I read from certain sites and including Kaggle there is Imagenet - ILSVRC2012 dataset which is around ~150Gb and has 1000 classes. But I also see from the Imagenet site that it obtains ~21,000 classes. Therefore, is the training in this paper done on the entire Imagenet dataset ~21,000 classes or on the smaller dataset of 1000 classes?

Also, do you think running your resnet_upscaled_static.sh script will provide similar results if I feed the dataset which obtains 1000 classes?

Also awesome paper btw!

odb2 avatar May 01 '20 11:05 odb2

Thanks very much for your question. We use ILSVRC2012 with ~150Gb and 1000 classes.

kaix90 avatar May 12 '20 02:05 kaix90

Thanks! One additional instruction that might help others would be when downloading imagenet the validation files do not come organized in folders like the training folder I had to run this script https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh which I found through https://docs.nvidia.com/deeplearning/dali/archives/dali_08_beta/dali-developer-guide/docs/examples/pytorch/resnet50/pytorch-resnet50.html to get them organized into folders.

odb2 avatar May 15 '20 07:05 odb2

@odbarragan Thank you. This is super important as the _find_classes method searches directories and in the absence of the directory arrangement, the code fails to work.

suvadeepbanerjee avatar Jun 24 '20 01:06 suvadeepbanerjee