glow-pytorch
glow-pytorch copied to clipboard
a question about the sigmoid function in the affine coupling layer
Hi, thank you for your amazing work. As you mentioned, using the sigmoid function in the affine coupling layer is benefical to the training. I was wondering why you shifted the sigmoid s = F.sigmoid(log_s + 2). Thanks a lot.
Shifting will make sigmoid values towards 1, so it is helpful for gradient flow at initializations.
Shifting will make sigmoid values towards 1, so it is helpful for gradient flow at initializations.
Thank you so much for your reply~