D4RL
D4RL copied to clipboard
Terminals? How is the data split up into trajectories and why do you make your own terminal finding code?
Quick question - In diffuser for the Maze2D environment why do you make your own terminal finding code given there are already timeouts given in the data?