NICE-GAN-pytorch icon indicating copy to clipboard operation
NICE-GAN-pytorch copied to clipboard

Training collapses after 72k iterations

Open Lauenburg opened this issue 2 years ago • 2 comments

I am currently trying to test the model on the maps data set (satellite and card images from google maps). The training went well until the 72k iteration. Afterward, the training collapses or rather no result is returned:

Images for A2B until 72000 iterations: A2B_0072000

Images for A2B after 72000 iterations: A2B_0073000

Images for B2A until 72000 iterations: B2A_0072000

Images for B2A until 72000 iterations: B2A_0073000

Any idea what could have happened here?

Lauenburg avatar Oct 16 '21 18:10 Lauenburg

I am currently trying to test the model on the maps data set (satellite and card images from google maps). The training went well until the 72k iteration. Afterward, the training collapses or rather no result is returned:

Images for A2B until 72000 iterations: A2B_0072000

Images for A2B after 72000 iterations: A2B_0073000

Images for B2A until 72000 iterations: B2A_0072000

Images for B2A until 72000 iterations: B2A_0073000

Any idea what could have happened here?

Hello, I have the same problem. How did you solve it?

Q-Zhang98 avatar Nov 16 '21 13:11 Q-Zhang98

It may be that there is a phenomenon of model collapse. The training schedule can be adjusted according to the dataset, and the checkpoint can be saved for manual inspection or FID for model selection.

alpc91 avatar Dec 02 '21 06:12 alpc91