Deep-Reinforcement-Learning-Algorithms-with-PyTorch
Deep-Reinforcement-Learning-Algorithms-with-PyTorch copied to clipboard
Weight Initialization for SAC Discrete
trafficstars
In your paper https://arxiv.org/pdf/1910.07207.pdf, the hyperparameter section states that you have used He weight initialization for the networks, but you seem to be using Xavier initialization for the Actor Critic in SAC discrete.