batch_rl issues

Asterix/1 dataset broken?

3

Hi, I tried reproducing the offline REM results with Asterix/1 dataset by using the command below: ``` python -um batch_rl.fixed_replay.train \ --base_dir=/tmp/batch_rl \ --replay_dir=/data_large/readonly/atari/Asterix/1 \ --agent_name=multi_head_dqn \ --gin_files='batch_rl/fixed_replay/configs/rem.gin' \ --gin_bindings='FixedReplayRunner.num_iterations=1000'...

dssrgu

Reading atari files directly.

3

Hi, contributing this example of how to read the atari files directly, in case anyone wants to do that. Note that the data is stored in the same temporal sequence...

DuaneNielsen

documentation

enhancement

How to evaluate agents?

2

I was able to train dqn agents using off-line data. I wonder how to evaluate agents ? e.g. reproduce the figure for Pong in Figure 3?

Altriaex

Can a customized env be added to the current framework?

1

Hi, I am new to this repo and offline setting for RL. I guess it should be possible, but still would like to hear some suggestions from the pros. More...

blurLake

Raw results

8

To facilitate comparison with a method we are developing, is it possible to release raw results (e.g. similar to [dopamine json files](https://github.com/google/dopamine/tree/master/baselines/data)?) These data already "exist" as part of your...

n17s

enhancement

Getting 7 as action for a game with 3 actions

1

I have been trying to train an online agent on the environment FreewayNoFrameskip-v4. Because this gym environment is not deterministic, I seeded the environment. Specifically, in [atari_lib.py](https://github.com/google/dopamine/blob/master/dopamine/discrete_domains/atari_lib.py), I added -...

arjung128

How to train offline agent on the huge dataset (50 Million) ?

3

Hi, I have read your paper which was published on ICML 2020, now I try to do some research on the offline image data. I have noticed that when training...

LQNew

documentation

good first issue

question

TF Version must < 2.0?

4

The code cant work in my environment where TF version is the newest(2.2.0). because the tf.contrib moudle has been removed?

weihongwei0586

good first issue

JAX code

11

Hi, I would like to ask whether there is a jax-based code. And whether there are some recommendations about jax-based offline rl algorithms. Thanks!

lucasliunju

enhancement

good first issue

Windows: basic test failed

1

I am on Windows, and run the basic test `python -um batch_rl.tests.atari_init_test`. But it failed. Here is the traceback: Running tests under Python 3.9.12: C:\Users\cenyyang\Anaconda3\python.exe [ RUN ] AtariInitTest.test_atari_init INFO:tensorflow:Saving...

yceny

batch_rl
batch_rl copied to clipboard

Metadata

Asterix/1 dataset broken?

Reading atari files directly.

How to evaluate agents?

Can a customized env be added to the current framework?

Raw results

Getting 7 as action for a game with 3 actions

How to train offline agent on the huge dataset (50 Million) ?

TF Version must < 2.0?

JAX code

Windows: basic test failed

← Metadata

Owner

Metadata

batch_rl batch_rl copied to clipboard

Metadata

← Metadata

Owner

Metadata

batch_rl
batch_rl copied to clipboard