wallcuber

Results 2 issues of wallcuber

Where should I modify the code of train.py? ![image](https://user-images.githubusercontent.com/58249279/75088808-20f68d00-558d-11ea-8f5b-4529bf2802ea.png)

`self.target_critic.load_state_dict(critic_state_dict)` above code seems make target critic network's parameter always be same as the critic network's. So what is the purpose? making the network learn more slowly? Hope somebody help...