Results 2 comments of SangwooShin

Thank you for great works. But I wonder whether the dataset cumulates sequentially. That is, along the implementation of Duane, whether data["observation"][1] indicates the state after we take data["action"][0] in...

Did you understand what is going on here? I'm stuck on the same issue.