where do you use "next_state" in def train(self):

Open yoshina621 opened this issue 6 years ago • 1 comments

you get next_state from self.get_n_step_info(self.n_step_buffer, self.gamma), but the next_state is not used. may be self.store_transition(p, obs, action, reward, next_obs, done) should be self.store_transition(p, obs, action, reward, next_state, done).

Nov 06 '19 11:11 yoshina621

Oh, sorry, I just see this issue. I think maybe you're right, thanks a lot dude, I've corrected it to reward, next_obs, done = self.get_n_step_info(self.n_step_buffer, self.gamma). You can check it now.

Jan 14 '20 02:01 Huixxi