modular_rl icon indicating copy to clipboard operation
modular_rl copied to clipboard

Implementation of TRPO and related algorithms

Results 11 modular_rl issues
Sort by recently updated
recently updated
newest added

The TRPO paper (Appendix C) claims that "we use a line search to ensure improvement of the surrogate objective and satisfaction of the KL divergence constraint". However, in the current...

Hi there, I am trying to use your code to train some new pybullet environment. Here is my Pip log: Keras (2.0.2) Markdown (2.6.11) mock (2.0.0) numpy (1.13.3) pbr (3.1.1)...

currently using python 3 gives: ``` File "run_pg.py", line 44 print "*********** Iteration %i ****************" % COUNTER ^ SyntaxError: Missing parentheses in call to 'print' ```

If I add dropout layer to model, will it be a bad idea? Any experiments there?

Hi John, I have read your TRPO paper and I'm trying to reproduce the Fisher-Vector Product calculation function in C. Line 36-37 in agentzoo.py make me confused. I copy the...

Hello John, After reading your paper on TRPO and view your code on GitHub, I am a little bit confused on steps regarding the prediction of value functions. Here, you...

Hi there, I'm trying to reproduce the results. But when running the code, I first ran into the Monitor error which caused by the updates of the gym environments. And...

Hi John, I'm trying to apply TRPO to the robotics control task, using vision. But constantly hit a GPU memory overhead in class NnRegression in fit during baseline calculation. On...

I am getting the following warning when I try to save the weights. Here I am loading the weights from a previously trained model. > {'warnflag': 1, 'task': 'STOP: TOTAL...

Hi, Can TRPO be used with the RNN's and LSTM or GRU in particular? It can be useful for locomotion partially observed tasks. And what modifications are required for adding...