FasterNet
FasterNet copied to clipboard
why not use activation functions after downsampling convolutions?
Great work! But why not use activation functions after downsampling convolutions?
@1920230345 Hi, we did not conduct an ablation study on this. We suggest empirical experiments for different FasterNet variants, as further incorporating activation functions may increase or decrease the model capacity.