Oscar

Results 3 comments of Oscar

Hi, @aaronsarna , I'm also curious about the linear probing, may I have your settings of final experiment? It seems that BatchNorm before the linear layer dosen't give a good...

@aaronsarna Thanks for your reply. You mean the SimMIM baseline can't provide the performance in the paper without removing the attention on the mask tokens manually?

Okay, thanks a lot for your help