MoVQGAN
MoVQGAN copied to clipboard
MoVQGAN - model for the image encoding and reconstruction
Awesome work! However, I somehow encountered the difficulty reproducing the performance of the autoencoder by training-from-scratch. Especially with the addition of GAN loss, getting collapsed reconstruction results. I used a...
Hi, thanks for sharing the code. [Here](https://github.com/ai-forever/MoVQGAN/blob/main/movqgan/modules/losses/lpips.py#L17) seems that you are using a `pretrained=False` VGG network. I wonder if this is the common practice? Thanks
I modified the 67M version configs to train the network with 384\*384 imgs, the size of z changed to 48\*48\*4. However it shown "cuda out of memory", even on an...
Admiring work! Do you have a fp16 version which would be better for other works like text-to-image?
Hello, may I ask you how does the hyperparameters n_embed and embed_dim influence the performance, could you give me some insight
thanks for this repo, its great. Can we get this upstreamed so its in the pip package?
I want to training from a small size image like cifar-10, wondering how to do it.
Could you please share the basic configuration file of Mo-VQGAN?