insightface icon indicating copy to clipboard operation
insightface copied to clipboard

jmlr training code vs description in arxiv paper

Open park-sungjoon opened this issue 3 years ago • 3 comments

Thanks for sharing your excellent work! I am looking through your code, and there seem to be some differences with the paper shared on arxiv (or I misunderstood something). Are the data preparation, training, and configuration files the same as those used to train the pretrained-model you share? Some examples:

  1. in rec_builder.py line 79, cfg.input_size = 512. My understanding is that this results in image with size 512, not 256 as described in the paper
  2. in train.py line 245, we have iter_loss.backward(), but iter_loss = dloss['Loss'], which does not include the "bone_losses". I.e. We only have L_vert + L_land in Eq. 4 in the paper on arxiv.

Thanks in advance.

park-sungjoon avatar Sep 21 '22 05:09 park-sungjoon

Hi,

  1. The training input size is 256, please see https://github.com/deepinsight/insightface/blob/master/reconstruction/jmlr/configs/s1.py#L18
  2. you are right, we will fix the code soon.

nttstar avatar Sep 21 '22 05:09 nttstar

Thanks for the reply. Yes, I checked the configuration file (s1.py and base.py); I was referring to:

if __name__ == "__main__":
    cfg = get_config('configs/s1.py')
    cfg.task = 0
    cfg.input_size = 512

in rec_builder.py. I'll be looking forward to the edited code :)

park-sungjoon avatar Sep 21 '22 06:09 park-sungjoon

The number 512 is just the size of the image in rec file, not the training size.

nttstar avatar Sep 21 '22 06:09 nttstar