wjdunham

Results 4 comments of wjdunham

Item 3. is due to a change in the Gym framework environment reset() method adding the "info" dict. to the return array making the return value a tuple. can be...

there is another issue with the env.step() method which has also been updated

Here is the update I used for the env.step() change ary_state, reward, terminated, _, info = env.step(ary_action) but there is now a new issue - there is no "done" flag...

This is the necessary update as Gym env.reset returns a dict with the array and "info" as well, we need to pull the array out: The old code has: if...