johnnylin110
Results
2
comments of
johnnylin110
update: I also check the paper from [Benchmarking Deep Reinforcement Learning for Continuous Control](https://arxiv.org/pdf/1604.06778.pdf) explain the Ant environment that  where zbody is the z-coordinate of the body this correspond...
@dkkim93 Thanks for your reply. I have the same observation too, and I was wondering is this just like a local optimal for this case ? Ant can flip over...