RepDistiller icon indicating copy to clipboard operation
RepDistiller copied to clipboard

How to train teacher model

Open tiancity-NJU opened this issue 4 years ago • 0 comments

When i try to train a teacher model resnet50 on new dataset(cub200), the backbone is different with origin resnet50. one the one hand, it is too big to use big batchsize(8 is ok on 16g gpu), on the other hand, the acc1 is too low when i train 300epoch which is 10% . why?

tiancity-NJU avatar Sep 23 '20 05:09 tiancity-NJU