deep-reinforcement-learning_DDQN_PPO_HER
deep-reinforcement-learning_DDQN_PPO_HER copied to clipboard
Dueling networks in mlp_framework.py (duel-double-DQN,van Hasselt)
see dueling_mlp class in mlp_framework.py
I've problems with calculating the correct loss for both streams (Advantage and Value).