xxx-007

Results 2 issues of


                                            xxx-007

ppo中出现NAN

2

comment

你好，莫烦老师，我在运行simple_ppo算法中，，根据当前状态选择一个动作 a=self.sess.run(self.sample_op,{self.tfs:s})[0]，，选择出来的动作为nan，，我应该如何修改，才能在运行代码过程中不在出现nan值，

about vrep scene's floor

hello, I am using vrep for reinforcement learning, and I need a big floor about 4000*4000. I find your floor is big about 2500*2500, and the way you scale‘s floor...