diffusion Sampling algorithm differ from paper.

Hi, I want to elaborate on #2: The sampling algorithm in your paper is a bit different that what shown in the paper.

The paper suggests this sample step Screenshot from 2021-05-20 12-36-24

while you do this: Screenshot from 2021-05-20 12-55-28

The clipping is done here https://github.com/hojonathanho/diffusion/blob/1e0dceb3b3495bbe19116a5e1b3596cd0706c543/diffusion_tf/diffusion_utils.py#L172

Now I checked and indeed, without the clipping, the two equations are the same. Can you give any interpretation or intuition for the clipping and why it is needed? It seem to be crucial for training while not mentioned in the paper

Thanks

May 20 '21 09:05 ariel415el

Is there any update on this? In my experience this detail has been crucial in determining sample quality, yet it seems to be largely unaddressed with regards to diffusion models. Does anyone have any insight on this?

Jul 29 '22 18:07 malekinho8

In https://huggingface.co/blog/annotated-diffusion, the author says:

Note that the code above is a simplified version of the original implementation. We found our simplification (which is in line with Algorithm 2 in the paper) to work just as well as the original, more complex implementation, which employs clipping.

Sep 24 '22 01:09 Kaffaljidhmah2

The issue is that the predictions are often out of range. So the authors are are trying to impose some sort of a correction to get meaningful samples. To do that they are restricting x_reconstructed to -1 to +1 by clipping. So, here is how they generation samples