Sridhar Thiagarajan
Results
3
comments of
Sridhar Thiagarajan
Just load a pre-trained policy, and do env.step(), and save all the states action pairs obtained?
As a side note, in case you're interested in quickly trying something out before this issue gets resolved, I would highly recommend the TD3 author's official implementation (which is in...
@DanielTakeshi Did you run any of these benchmarks on vision-based tasks, or know of any results?