Sridhar Thiagarajan

Results 3 comments of Sridhar Thiagarajan

Just load a pre-trained policy, and do env.step(), and save all the states action pairs obtained?

As a side note, in case you're interested in quickly trying something out before this issue gets resolved, I would highly recommend the TD3 author's official implementation (which is in...