pytorch-a2c-ppo-acktr-gail icon indicating copy to clipboard operation
pytorch-a2c-ppo-acktr-gail copied to clipboard

init of neural network

Open KK666-AI opened this issue 5 years ago • 3 comments

Dear Author,

Thanks for sharing the excellent work on reproducing reinforcement learning algorithms. I notice that you use init_ = lambda m: init(m, nn.init.orthogonal_, lambda x: nn.init.constant_(x, 0)) to initialize neural networks and I find it makes neural networks much more stable. However, I don't understand the underlying theory of this trick, could you have an explanation or give some related papers?

Thanks.

KK666-AI avatar Mar 03 '20 09:03 KK666-AI

Hi, I've also realized, if I don't use this init method, a2c doesn't improve when number of processes are not high enough, and I'm also curious about the reason behind it. I'd also be happy to learn more about this, if someone can explain, thank you.

realiti4 avatar Apr 14 '20 18:04 realiti4

Looking forward to an explanation too!

KarlXing avatar Jul 13 '20 05:07 KarlXing

I am looking forward to an explanation as well!

shtse8 avatar Sep 06 '20 17:09 shtse8