wolpertinger_ddpg
wolpertinger_ddpg copied to clipboard
Update train_test.py
I believe that the episode_steps and episode_reward should be equal to zero after each episode finish