D4RL
D4RL copied to clipboard
In MuJoCo dataset, how can we figure out the beginning of each episode?
Hi, I would like to use d4rl dataset for DICE scenarios, where sampling from initial states is required. I thought the termination flag could be helpful at first glance, but I've noticed from #34 that termination=False
when an agent reaches the maximum length of episodes.
Is there another recommended method for this issue?
Thanks for your support in advance!
If you're using the MuJoCo datasets, the timeouts flag will be set to true at the end of an episode, so you can use that to find the end of an episode.
You can also use this function to get an iterator through trajectories: https://github.com/rail-berkeley/d4rl/blob/master/d4rl/init.py#L138