batch_rl icon indicating copy to clipboard operation
batch_rl copied to clipboard

Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games

Results 16 batch_rl issues
Sort by recently updated
recently updated
newest added

Hi, I tried reproducing the offline REM results with Asterix/1 dataset by using the command below: ``` python -um batch_rl.fixed_replay.train \ --base_dir=/tmp/batch_rl \ --replay_dir=/data_large/readonly/atari/Asterix/1 \ --agent_name=multi_head_dqn \ --gin_files='batch_rl/fixed_replay/configs/rem.gin' \ --gin_bindings='FixedReplayRunner.num_iterations=1000'...

Hi, contributing this example of how to read the atari files directly, in case anyone wants to do that. Note that the data is stored in the same temporal sequence...

documentation
enhancement

I was able to train dqn agents using off-line data. I wonder how to evaluate agents ? e.g. reproduce the figure for Pong in Figure 3?

Hi, I am new to this repo and offline setting for RL. I guess it should be possible, but still would like to hear some suggestions from the pros. More...

To facilitate comparison with a method we are developing, is it possible to release raw results (e.g. similar to [dopamine json files](https://github.com/google/dopamine/tree/master/baselines/data)?) These data already "exist" as part of your...

enhancement

I have been trying to train an online agent on the environment FreewayNoFrameskip-v4. Because this gym environment is not deterministic, I seeded the environment. Specifically, in [atari_lib.py](https://github.com/google/dopamine/blob/master/dopamine/discrete_domains/atari_lib.py), I added -...

Hi, I have read your paper which was published on ICML 2020, now I try to do some research on the offline image data. I have noticed that when training...

documentation
good first issue
question

The code cant work in my environment where TF version is the newest(2.2.0). because the tf.contrib moudle has been removed?

good first issue

Hi, I would like to ask whether there is a jax-based code. And whether there are some recommendations about jax-based offline rl algorithms. Thanks!

enhancement
good first issue

I am on Windows, and run the basic test `python -um batch_rl.tests.atari_init_test`. But it failed. Here is the traceback: Running tests under Python 3.9.12: C:\Users\cenyyang\Anaconda3\python.exe [ RUN ] AtariInitTest.test_atari_init INFO:tensorflow:Saving...