taming-transformers icon indicating copy to clipboard operation
taming-transformers copied to clipboard

How to evaluate on ImageNet?

Open JohnDreamer opened this issue 3 years ago • 2 comments
trafficstars

Hi, it's a great work!And thanks for releasing the code! But I have a question --how to evaluate on ImageNet? In other words, should I get the FID scores on the whole ImageNet validation set (totally 50K images) (VQGAN: reconstruction images; Transformer: sampled images)? Should I split the dataset?

JohnDreamer avatar May 23 '22 15:05 JohnDreamer

Thanks :) Reconstruction metrics should be evaluated on all 50k examples from the validation split. For transformer evaluation (i.e. sample quality), we follow standard practive and first generate 50k new samples and then evaluate against the full training set.

rromb avatar Jun 02 '22 20:06 rromb

Thanks :) Reconstruction metrics should be evaluated on all 50k examples from the validation split. For transformer evaluation (i.e. sample quality), we follow standard practive and first generate 50k new samples and then evaluate against the full training set.

Thanks for reply! I still have two questions: (1) how to process the GT images for evaluation? a. only resize each image to 256x256; b. resize the shot edge of the image to 256 and center crop it to 256x256, which do you use? (4) How to sample test images? Do you sample the image of the same number (50) for each class?

JohnDreamer avatar Jun 06 '22 01:06 JohnDreamer