EfficientZero
EfficientZero copied to clipboard
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
Hi, thanks for the repository. Could you consider releasing the code that supports continuous actions spaces for the DMControl 100k benchmark please ? The code that uses the discretization of...
(scheduler +2m13s) Warning: The following resource request cannot be scheduled right now: {'GPU': 0.125, 'CPU': 0.5}. This is likely due to all cluster resources being claimed by actors. Consider creating...
I always can't run the test phase completely, most of the training stops running when the test phase reaches about 3%-7%, and there is no error reported. Could you please...
Thanks for your open-sourced code very much. This is a common definition of an target value in classical RL: I'm a little confused about the way of calculating target value...
Any guidance for using with SLURM? Certain actors are failing When I run `srun -p compsci-gpu --gres=gpu:4 --cpus-per-gpu=5 --mem=24G --pty bash` Followed by: `python main.py --env BreakoutNoFrameskip-v4 --case atari --opr...
Hi, I had trouble identifying the right mix of python and packages to get this to run. Could you please review/confirm the python version and requirements.txt for either one of...
Hello, first of all thanks for your amazing job on EfficientZero. I tried to adapt EfficientZero on BabyAI environment like: "PutNextLocal", but it just keep give me 0 test score...
Thanks for your open-sourced code very much. I'm a little confused about the reason for the identity connection of state encoding in [DynamicsNetwork](https://github.com/YeWR/EfficientZero/blob/main/config/atari/model.py#L252) in model.py: Why do we add this...
Thanks for you open-sourced code very much. Recently, I want to apply the model used for breakout to other games, but I find that different games have different action Spaces,...
Thanks for you open-sourced code very much. I am very confused about this code segment in [put_last_trajectory](https://github.com/YeWR/EfficientZero/blob/main/core/selfplay_worker.py#L69) method in selfplay_worker.py: In [Line 69](https://github.com/YeWR/EfficientZero/blob/main/core/selfplay_worker.py#L69) , why is, ` pad_child_visits_lst = game_histories[i].child_visits[beg_index:end_index]`...