Densen
Results
1
issues of
Densen
It seems like in forward propogation of WAB, the scaling factor beta (0.2) is applied on the output of BatchNorm layer instead of applying it on residual input features.