pytorchrl icon indicating copy to clipboard operation
pytorchrl copied to clipboard

TRPO code is not performing well compared to Other implementations

Open nosyndicate opened this issue 7 years ago • 1 comments

Pearlmutter method only gives "good" value of hessian vector product in the first two iterations in conjugate gradient loop

nosyndicate avatar Nov 03 '17 14:11 nosyndicate

Some other implementation has much better performance half cheetah. https://arxiv.org/pdf/1708.04133.pdf

nosyndicate avatar Mar 03 '18 06:03 nosyndicate