xxx-007

Results 2 issues of xxx-007

你好,莫烦老师,我在运行simple_ppo算法中,,根据当前状态选择一个动作 a=self.sess.run(self.sample_op,{self.tfs:s})[0],,选择出来的动作为nan,,我应该如何修改,才能在运行代码过程中不在出现nan值,

hello, I am using vrep for reinforcement learning, and I need a big floor about 4000*4000. I find your floor is big about 2500*2500, and the way you scale‘s floor...