PVDM icon indicating copy to clipboard operation
PVDM copied to clipboard

Question about the autoencoder design

Open Darius-H opened this issue 9 months ago • 0 comments

Q1: As latent diffusion uses VAE, why did you modify the structure to autoencoder, is it because of poor VAE performance?

Q2: Why design a bottleneck structure here? https://github.com/sihyun-yu/PVDM/blob/17699659148423469c0d1ccdca5e466933b943e1/models/autoencoder/autoencoder_vit.py#L180C1-L190C34

Darius-H avatar May 08 '24 19:05 Darius-H