batch_rl
batch_rl copied to clipboard
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
Hi, I tried reproducing the offline REM results with Asterix/1 dataset by using the command below: ``` python -um batch_rl.fixed_replay.train \ --base_dir=/tmp/batch_rl \ --replay_dir=/data_large/readonly/atari/Asterix/1 \ --agent_name=multi_head_dqn \ --gin_files='batch_rl/fixed_replay/configs/rem.gin' \ --gin_bindings='FixedReplayRunner.num_iterations=1000'...
Hi, contributing this example of how to read the atari files directly, in case anyone wants to do that. Note that the data is stored in the same temporal sequence...
I was able to train dqn agents using off-line data. I wonder how to evaluate agents ? e.g. reproduce the figure for Pong in Figure 3?
Hi, I am new to this repo and offline setting for RL. I guess it should be possible, but still would like to hear some suggestions from the pros. More...
To facilitate comparison with a method we are developing, is it possible to release raw results (e.g. similar to [dopamine json files](https://github.com/google/dopamine/tree/master/baselines/data)?) These data already "exist" as part of your...
I have been trying to train an online agent on the environment FreewayNoFrameskip-v4. Because this gym environment is not deterministic, I seeded the environment. Specifically, in [atari_lib.py](https://github.com/google/dopamine/blob/master/dopamine/discrete_domains/atari_lib.py), I added -...
Hi, I have read your paper which was published on ICML 2020, now I try to do some research on the offline image data. I have noticed that when training...
The code cant work in my environment where TF version is the newest(2.2.0). because the tf.contrib moudle has been removed?
JAX code
Hi, I would like to ask whether there is a jax-based code. And whether there are some recommendations about jax-based offline rl algorithms. Thanks!
I am on Windows, and run the basic test `python -um batch_rl.tests.atari_init_test`. But it failed. Here is the traceback: Running tests under Python 3.9.12: C:\Users\cenyyang\Anaconda3\python.exe [ RUN ] AtariInitTest.test_atari_init INFO:tensorflow:Saving...