diffusion_models
diffusion_models copied to clipboard
EMA model usage
In notebooks I see updates of EMA model def update
, where we change shallow model, but def ema
method was never called. It should be called ones per N epochs or it will be used at the end of all training process? As I can see, def update
does not affect actual model and smoothing is not happen.