MoVQGAN icon indicating copy to clipboard operation
MoVQGAN copied to clipboard

About more detailed configuration for model reproduction

Open minkowski0125 opened this issue 1 year ago • 1 comments

Awesome work! However, I somehow encountered the difficulty reproducing the performance of the autoencoder by training-from-scratch. Especially with the addition of GAN loss, getting collapsed reconstruction results. I used a subset of laion with aesthetic score >=5, training with batch size=256 and default loss weighting configurations. May I ask the appropriate configuration for reproducing the performance of MoVQGAN? Thank you very much.

minkowski0125 avatar Jan 04 '24 12:01 minkowski0125