Nandan Kumar Jha

Results 2 issues of Nandan Kumar Jha

To me, it's confusing, how to set the "shapes" and "out_shapes" when the student is ResNet18 and the teacher is ResNet34 on CIFAR-100. Is it shapes = out_shapes = [1,...

Hi ! Interesting work on the role of explicit bias! I was wondering what training settings got you an eval PPL ~3.04. The paper mentions that 50K iterations are required...