latent-diffusion icon indicating copy to clipboard operation
latent-diffusion copied to clipboard

Question about some techniques

Open UdonDa opened this issue 4 years ago • 0 comments

Hi @pesser! Thank you for sharing the implementation of your wonderful work!

I have questions about some techniques. Would you tell me these questions?

I have used your pretrained celeba256 weight. The images were recorded using such as intermediates['x_inter'].append(img).

  1. Why do you step the time? According to this line, it seems you choose time values for each num_ddpm_timesteps // num_ddim_timesteps. Actually, I have never seen this technique.

  2. If I do not step the above values and T: 1000 -> 0, i.e. the time steps are continuous and have ranged from 1 to 1000, I cannot get clear results. This image was recorded in six separate 1000 iterations. This image was recorded using x_inter. image

This image was recorded using pred_x0. image

  1. If the time step is fixed to default values in this line and the start time is decreased such as to 800, I cannot clear results. Why? Your method cannot work well other than t=1000? (actually, if t=1000, 50 iterations because time steps are split) image

  2. If your method cannot perform the question (3), your method cannot perform this unique denoising technique as shown in Sohl-Dickstein+ ICML15? Can you possibly accomplish it? image

Best regards, Udon

UdonDa avatar Dec 22 '21 10:12 UdonDa