RahulChakwate

Results 3 comments of RahulChakwate

That great! Did you change any training configurations in `config_cls.yaml`? I tried to implement the exact 11 layer architecture mentioned in the paper with the same config settings but could...

@sheshappanavar Yes, I mean L=11 excluding the 3 fc layers. So are you able to reproduce 93.2% using the given L=6 code or did you implement the L=11 code given...

Can someone please explain this? @liuyuyuil did you find a solution?