TRPO-TensorFlow
TRPO-TensorFlow copied to clipboard
Trust Region Policy Optimization (TRPO) in pure TensorFlow
Results
1
TRPO-TensorFlow issues
Sort by
recently updated
recently updated
newest added
Hey, I was running an implementation of your code, and it seems like the kl_pen is always zero. It seems like its because the oldlog_vars and log_vars are the same....