TRPO-TensorFlow icon indicating copy to clipboard operation
TRPO-TensorFlow copied to clipboard

Trust Region Policy Optimization (TRPO) in pure TensorFlow

Results 1 TRPO-TensorFlow issues
Sort by recently updated
recently updated
newest added

Hey, I was running an implementation of your code, and it seems like the kl_pen is always zero. It seems like its because the oldlog_vars and log_vars are the same....