PVDM
PVDM copied to clipboard
Question about the autoencoder design
Q1: As latent diffusion uses VAE, why did you modify the structure to autoencoder, is it because of poor VAE performance?
Q2: Why design a bottleneck structure here? https://github.com/sihyun-yu/PVDM/blob/17699659148423469c0d1ccdca5e466933b943e1/models/autoencoder/autoencoder_vit.py#L180C1-L190C34