pytorchrl
pytorchrl copied to clipboard
TRPO code is not performing well compared to Other implementations
Pearlmutter method only gives "good" value of hessian vector product in the first two iterations in conjugate gradient loop
Some other implementation has much better performance half cheetah. https://arxiv.org/pdf/1708.04133.pdf