MoVQGAN
MoVQGAN copied to clipboard
About more detailed configuration for model reproduction
Awesome work! However, I somehow encountered the difficulty reproducing the performance of the autoencoder by training-from-scratch. Especially with the addition of GAN loss, getting collapsed reconstruction results. I used a subset of laion with aesthetic score >=5, training with batch size=256 and default loss weighting configurations. May I ask the appropriate configuration for reproducing the performance of MoVQGAN? Thank you very much.