EDGE Why use x_start as the target in each timestep of diffusion training?

Why use x_start as the target in each timestep of diffusion training?

Open shaoguowen opened this issue 5 months ago • 1 comments

I have seen using noise, x_noisy or v_prediction, etc. as the training target, but each timestep uses x_start as the training target, which seems a bit strange. Can you explain it or provide relevant articles?

Feb 04 '24 02:02 shaoguowen

EDGE EDGE copied to clipboard

Why use x_start as the target in each timestep of diffusion training?

EDGE
EDGE copied to clipboard