D4RL
D4RL copied to clipboard
Maze2d rewards seem odd?
Since maze2d-umaze-v1 uses a sparse reward type where a distance less than 0.5 yields a reward of 1, I was wondering how the above is possible. The first timeout field is at 91, yet the rewards leading up to it are all zeros. Would appreciate any thoughts!
Hi did you solve this please?