D4RL-Evaluations
D4RL-Evaluations copied to clipboard
Maze2d tasks don't have a goal location in the state
Hi,
I find it irritating that the observations in the maze2d tasks only contain the 2d positions/velocities. If the agent is not informed about the goal location (which can be found in info/goal
in the data set), it can't decide whether to go eg. left or right as the goal might be on either side.
How was that dealt with in the experiments from the paper? Is the agent conditioned on the goal in some form?
Thanks, -Justin