bysowhat

Results 4 issues of bysowhat

warning: enumeration value ‘CUDNN_STATUS_RUNTIME_FP_OVERFLOW’ not handled in switch [-Wswitch]

hello: do you use gumbelsoftmax? i wonder why don't you add random noise in gumbelsoftmax and the temperature doesn't change in whole training process? thanks

could you explain what the f is in equ 4 in your paper? tanks a lot ![image](https://user-images.githubusercontent.com/9412053/118443374-62d59b00-b71e-11eb-88c0-d570fc941b26.png)

Hi, what is normalization_scaling for? Thanks