DenseNet icon indicating copy to clipboard operation
DenseNet copied to clipboard

Why is composite function BN-ReLU-Conv3x3 ?

Open RDShi opened this issue 6 years ago • 1 comments

Hello,

The composite function of other models is Conv3x3-BN-ReLU. Why is DenseNet special?

Looking forward to your answer. Thanks

RDShi avatar Jul 27 '18 16:07 RDShi

Hi. This is following the preactivation design in the second ResNet paper. https://arxiv.org/abs/1603.05027

The essential difference here is that there are different scaling parameters in the BN layer in each BN-ReLU-Conv3x3. If we use BN after ReLU, every subsequent layer will be based on the same BN scaling parameters.

liuzhuang13 avatar May 11 '19 23:05 liuzhuang13