ResMLP-pytorch
ResMLP-pytorch copied to clipboard
The number of Affine
Dear, In the paper, the authors claimed that each sublayer has a residual connection and two Affine transformations. But, in your codes, I just find one Affine transformation in a residual connection. Could you tell me why? Thanks.