ResidualAttentionNetwork-pytorch
ResidualAttentionNetwork-pytorch copied to clipboard
a pytorch code about Residual Attention Network. This code is based on two projects from
Is there anyone train the resattentionnet on ImageNet? The paper didn't provide the batchsize for ImageNet training. So I set the batchsize=256/lr=0.1 which is a common setting, but the training...
what's the version of torch, torchvision and python? can anyone explain it?
i think the num of params for cifar10 residual network is incorrect, i find that it is much bigger than the num in paper
Hi, I am confused about the term `softmax_blocks`. The term in the paper should be `soft mask blocks`? I check the ResidualBlock class which does not exist normalization layers.
It seems that the this code reproduced results can not achieve the results in the original paper ?
I have question about the the soft attention mask. I have implemented residual attention blocks for specific domain (faces). How does the attention mask focus on specific regions of the...
Hi, Is the model_92_sgd.pkl is pre_trained for cifar10? Does the imagenet has the pretrained model? Thanks
Hello, I studied your code carefully, and then I found that there are different formulas for Mixed Attention, Channel Attention and Spatial Attention in the paper. But I don't see...
Hi, Can we implement the same network for 3D data by using 3d layers of same 2d layers? What do you advice?