tensorflow-rl icon indicating copy to clipboard operation
tensorflow-rl copied to clipboard

Implementations of deep RL papers and random experimentation

Results 9 tensorflow-rl issues
Sort by recently updated
recently updated
newest added

In CTS-DQN, why we update the CTS model by using the next frame but not the same frame as used by action selection? refer to https://github.com/steveKapturowski/tensorflow-rl/blob/master/algorithms/intrinsic_motivation_actor_learner.py#L417

Did anyone face the issue of the training process slowing down? For example, training one DQN-CTS worker on Montezuma's Revenge runs at about 220 iter/sec after 100.000 steps and 35...

Hi Steve, I am trying to reproduce the ~3600 score you achieved on Montezuma's Revenge with your dqn-cts model (as per the gif image on README). With 30M steps counting,...

No scalar data was found. How did you guys get the code to work with tensorboard? the summary part seems to be not working when training. ....................................................................... python main.py Pong-v0...

Hi Steve, I feel like asking you another newbie question. I am having a hard time running fast_cts module. I think I successfully compiled the cython module, but when using...

Look at the following code: ``` with self.monitored_environment(), session_context as self.session: self.synchronize_workers() if self.is_train: self.train() else: self.test() ``` After trying several times, I felt the "with ... as" will _**exit**_...

I've tried to use TRPO to create a model for `CartPole-v0` by following the instructions on your [OpenAI Gym page](https://gym.openai.com/evaluations/eval_4QXCRAATTDqakJV0YZlJ4g#reproducibility), changing the command to the following to reflect the api...

In many places in the code, there's a hardwired expectation that the environment will either be from ALE, or else it will be from some other type of video game....