baselines issues

Question about deepcopy to environments (SubprocVecEnv)

1

Dear author, Thank you for provide this useful baselines. It is very useful for my research. But now, I have a question about deepcopy to SubprocVecEnv. In my code, I...

baifanxxx

Question on mujoco reward

When training ppo2 using mujoco environment, I find that episode reward earned from infos['episode']['r'] doesn't equal to the sum of rewards of each step. In the Humanoid environment, summing up...

SeanZChen

Module 'tensorflow' has no attribute 'set_random_seed' error when running baselines

1

When I tried to run the [PPO2 baseline](https://colab.research.google.com/drive/1rU20zJ281sZuMD1DHbsODFr1DbASL0RH#scrollTo=f3AsF_nuTpOj), I encountered this error: `Module 'tensorflow' has no attribute 'set_random_seed'` As I dig deeper I realized that in the TF2 this function...

erfanMhi

policy entropy in PPO2

1

Hi, On applying PPO2 to a custom Mujoco environment, the policy entropy is continuously increasing even with a small entropy coefficient of 0.01 or even less. In my understanding, ideally...

akhilsanand

AttributeError: 'EnvSpec' object has no attribute '_entry_point'

10

/home/yxh/anaconda3/envs/tensorenv/lib/python3.6/site-packages/baselines/baselines/run.py 2 places:env._entry_point.split(':')[0].split('.')[-1] change to env.entry_point.split(':')[0].split('.')[-1]

PolarisYxh

Trained model not working

5

I have trained the PPO2 model on Walker2d-v2 environment with following command with nminibatches=64 python -m baselines.run --alg=ppo2 --env=Walker2d-v2 --num_timesteps=1e6 --seed=30 --network=mlp --num_env=1 --save_path="/home/surabhi/Downloads/github/baselines/result/walker2d/30/ppo2" --log_path="/home/surabhi/Downloads/github/baselines/result/walker2d/30/" ![30](https://user-images.githubusercontent.com/51375621/70596643-65b88c00-1c0c-11ea-92e1-95cdaaa1af76.png) But when i run...

surbhi1944

Updated acktr.py

Used random keyword instead of set keyword for Tensorflow 2.

ushukkla

TypeError: mlp() got an unexpected keyword argument 'value_network'

6

After installation of tf2 version, I tried to run the check command in readme I got the error above > python -m baselines.run --alg=ppo2 --env=Humanoid-v2 --network=mlp --num_timesteps=2e7 Logging to /tmp/openai-2019-10-30-11-49-36-171979...

JiyueWang

tf2

Add kwarg to pass custom 'worker' to SubprocVecEnv

Signed-off-by: Fabrice Normandin

lebrice

AssertionError: Do not use tf.reset_default_graph() to clear nested graphs. If you need a cleared graph, exit the nesting and create a new graph.

2

Hello, I try to write a loop code to test the training effect of DQN agent, which needs to load the model multiple times and reset the environment and tensorflow...

SrAlexBay

baselines
baselines copied to clipboard

Metadata

Question about deepcopy to environments (SubprocVecEnv)

Question on mujoco reward

Module 'tensorflow' has no attribute 'set_random_seed' error when running baselines

policy entropy in PPO2

AttributeError: 'EnvSpec' object has no attribute '_entry_point'

Trained model not working

Updated acktr.py

TypeError: mlp() got an unexpected keyword argument 'value_network'

Add kwarg to pass custom 'worker' to SubprocVecEnv

AssertionError: Do not use tf.reset_default_graph() to clear nested graphs. If you need a cleared graph, exit the nesting and create a new graph.

← Metadata

Owner

Metadata

baselines baselines copied to clipboard

Metadata

← Metadata

Owner

Metadata

baselines
baselines copied to clipboard