tensorflow-rl issues

PseudoCountQLearner

In CTS-DQN, why we update the CTS model by using the next frame but not the same frame as used by action selection? refer to https://github.com/steveKapturowski/tensorflow-rl/blob/master/algorithms/intrinsic_motivation_actor_learner.py#L417

xfdywy

Training slowing down dramatically

1

Did anyone face the issue of the training process slowing down? For example, training one DQN-CTS worker on Montezuma's Revenge runs at about 220 iter/sec after 100.000 steps and 35...

ionelhosu

reproducing your stellar result on Montzuma's Revenge

9

Hi Steve, I am trying to reproduce the ~3600 score you achieved on Montezuma's Revenge with your dqn-cts model (as per the gif image on README). With 30M steps counting,...

dhfromkorea

Tensorboard summaries not appearing

1

No scalar data was found. How did you guys get the code to work with tensorboard? the summary part seems to be not working when training. ....................................................................... python main.py Pong-v0...

mabirck

segmentation fault with fast_cts.pyx

7

Hi Steve, I feel like asking you another newbie question. I am having a hard time running fast_cts module. I think I successfully compiled the cython module, but when using...

dhfromkorea

About actor_learner.py

Look at the following code: ``` with self.monitored_environment(), session_context as self.session: self.synchronize_workers() if self.is_train: self.train() else: self.test() ``` After trying several times, I felt the "with ... as" will _**exit**_...

lezhang-thu

Distributed training

steveKapturowski

Can't test CartPole-v0 model trained with TRPO

3

I've tried to use TRPO to create a model for `CartPole-v0` by following the instructions on your [OpenAI Gym page](https://gym.openai.com/evaluations/eval_4QXCRAATTDqakJV0YZlJ4g#reproducibility), changing the command to the following to reflect the api...

captify-alapite

Code hardwired to Atari and emulator environments

1

In many places in the code, there's a hardwired expectation that the environment will either be from ALE, or else it will be from some other type of video game....

captify-alapite

tensorflow-rl
tensorflow-rl copied to clipboard

Metadata

PseudoCountQLearner

Training slowing down dramatically

reproducing your stellar result on Montzuma's Revenge

Tensorboard summaries not appearing

segmentation fault with fast_cts.pyx

About actor_learner.py

Distributed training

Can't test CartPole-v0 model trained with TRPO

Code hardwired to Atari and emulator environments

← Metadata

Owner

Metadata

tensorflow-rl tensorflow-rl copied to clipboard

Metadata

← Metadata

Owner

Metadata

tensorflow-rl
tensorflow-rl copied to clipboard