diffusion_distiller
diffusion_distiller copied to clipboard
Exploding target in distillation
Hello @Hramchenko ! Thank you for your work on this repo.
I am trying to implement the distillation for v-parametrization and used your code. I am having an issue with this line, because sigma_s
can be very small (ex: 1e-2) which causes eps_2
to become very large and thus v_2
too, while it should remain close to the [-1,1] range.
Would you have any insights on that ? Did you derive the equations yourself ?