EfficientZero issues

Code for continuous action space

1

Hi, thanks for the repository. Could you consider releasing the code that supports continuous actions spaces for the DMControl 100k benchmark please ? The code that uses the discretization of...

davidva1

ray warning

1

(scheduler +2m13s) Warning: The following resource request cannot be scheduled right now: {'GPU': 0.125, 'CPU': 0.5}. This is likely due to all cluster resources being claimed by actors. Consider creating...

QiGuLongDongQiang

Question about the test phase not always running fully

1

I always can't run the test phase completely, most of the training stops running when the test phase reaches about 3%-7%, and there is no error reported. Could you please...

QiGuLongDongQiang

Question about the effect of discount factor and done mask when calculating the target value?

Thanks for your open-sourced code very much. This is a common definition of an target value in classical RL: I'm a little confused about the way of calculating target value...

puyuan1996

How to use with SLURM

Any guidance for using with SLURM? Certain actors are failing When I run `srun -p compsci-gpu --gres=gpu:4 --cpus-per-gpu=5 --mem=24G --pty bash` Followed by: `python main.py --env BreakoutNoFrameskip-v4 --case atari --opr...

dillonmsandhu

WSL2 NVIDIA 3090 or M1 MBP correct environment

Hi, I had trouble identifying the right mix of python and packages to get this to run. Could you please review/confirm the python version and requirements.txt for either one of...

atalapan

Question about getting zero test score when I try to run EfficientZero on BabyAI grid environment

2

Hello, first of all thanks for your amazing job on EfficientZero. I tried to adapt EfficientZero on BabyAI environment like: "PutNextLocal", but it just keep give me 0 test score...

jiachengc

Question about the effect of state encoding indentity connection in dynamics network

1

Thanks for your open-sourced code very much. I'm a little confused about the reason for the identity connection of state encoding in [DynamicsNetwork](https://github.com/YeWR/EfficientZero/blob/main/config/atari/model.py#L252) in model.py: Why do we add this...

puyuan1996

Question about whether need to train multiple agents for different games

1

Thanks for you open-sourced code very much. Recently, I want to apply the model used for breakout to other games, but I find that different games have different action Spaces,...

QiGuLongDongQiang

Question about the index of pad_child_visits_lst in selfplay_worker.py

2

Thanks for you open-sourced code very much. I am very confused about this code segment in [put_last_trajectory](https://github.com/YeWR/EfficientZero/blob/main/core/selfplay_worker.py#L69) method in selfplay_worker.py: In [Line 69](https://github.com/YeWR/EfficientZero/blob/main/core/selfplay_worker.py#L69) , why is, ` pad_child_visits_lst = game_histories[i].child_visits[beg_index:end_index]`...

puyuan1996

EfficientZero
EfficientZero copied to clipboard

Metadata

Code for continuous action space

ray warning

Question about the test phase not always running fully

Question about the effect of discount factor and done mask when calculating the target value?

How to use with SLURM

WSL2 NVIDIA 3090 or M1 MBP correct environment

Question about getting zero test score when I try to run EfficientZero on BabyAI grid environment

Question about the effect of state encoding indentity connection in dynamics network

Question about whether need to train multiple agents for different games

Question about the index of pad_child_visits_lst in selfplay_worker.py

← Metadata

Owner

Metadata

EfficientZero EfficientZero copied to clipboard

Metadata

← Metadata

Owner

Metadata

EfficientZero
EfficientZero copied to clipboard