Densen

Results 1 issues of Densen

It seems like in forward propogation of WAB, the scaling factor beta (0.2) is applied on the output of BatchNorm layer instead of applying it on residual input features.