WideResNet-pytorch
WideResNet-pytorch copied to clipboard
The place to add dropout
In your code, you add dropout after relu2, your process is like dropout(relu2(bn2(conv1(relu1(bn1(x))))))
.
But it this code, he adds dropout after conv1, his process is like relu2(bn2(dropout(conv1(relu1(bn1(x))))))
.
Does it matter? What is the difference on performance between the two methods? I'm troubled, I can't reproduce the performance on CIFAR10(I only get accuracy at 93.2%) using the second method.