RahulChakwate
Results
3
comments of
RahulChakwate
That great! Did you change any training configurations in `config_cls.yaml`? I tried to implement the exact 11 layer architecture mentioned in the paper with the same config settings but could...
@sheshappanavar Yes, I mean L=11 excluding the 3 fc layers. So are you able to reproduce 93.2% using the given L=6 code or did you implement the L=11 code given...
Can someone please explain this? @liuyuyuil did you find a solution?