GP-UNIT icon indicating copy to clipboard operation
GP-UNIT copied to clipboard

How to calculate the FID and LPIPS

Open zjwulbx opened this issue 2 years ago • 2 comments

Dear authors: I want to ask about the evaluation FID and LPIPS. When we calculate the FID and LPIPS, what dataset do we use? and under what circumstances(such as some details as how many pictures do we use? Using training dataset or testing dataset? What are the sources of real_images and fake_images, e.g. cat2dog)? Can you tell me more details about evaluation? Looking forward to you early reply!

zjwulbx avatar Oct 21 '22 02:10 zjwulbx

We use and modify this code for evaluation https://github.com/clovaai/stargan-v2/blob/875b70a150609e8a678ed8482562e7074cdce7e5/metrics/eval.py

Fake images are generated from the testing set of the dataset. For cat2dog, there are 500 testing images (https://github.com/clovaai/stargan-v2/blob/master/README.md#animal-faces-hq-dataset-afhq).

For FID, the real images are the training images in the dataset.

williamyang1991 avatar Oct 21 '22 02:10 williamyang1991

Thank you for your answer

zjwulbx avatar Oct 21 '22 05:10 zjwulbx