mujoco-maze
mujoco-maze copied to clipboard
Maze Ant observation shape
Hello. I am just curious why the observation space of Maze Ant has a shape 30? The original Ant has 111. Is something excluded? If yes, how could I use the full observation space?
Thanks in advance.
- The first 29 elements are Mujoco model's qpos and qvel. I'm sorry but this is inherited from the implementation of HIRO paper (https://github.com/tensorflow/models/blob/master/research/efficient-hrl/environments/ant.py#L81) and I don't know the design choice precisely. Maybe for including the coordinates?
- The last one is timestep. I'm going to make this optional in the future release.
- The first 29 elements are Mujoco model's qpos and qvel. I'm sorry but this is inherited from the implementation of HIRO paper (https://github.com/tensorflow/models/blob/master/research/efficient-hrl/environments/ant.py#L81) and I don't know the design choice precisely. Maybe for including the coordinates?
- The last one is timestep. I'm going to make this optional in the future release.
Thank you for your answer. It's difficult to understand the observation space as everything leads back to the MujocoEnv class (at least from my attempt). Looking forward to your future releases. Cheers!