EfficientZero icon indicating copy to clipboard operation
EfficientZero copied to clipboard

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Results 33 EfficientZero issues
Sort by recently updated
recently updated
newest added

I found some BatchNorm(BN) ops are used in config/atari/model.py, but BN is generally not used for RL. So I do some ablations with and without BN. The following results show...

I wonder if there is any tutorial how to add own custom gym environment to use EffficientZero algorithm ? Where the model is saved after training? How to use saved...

Hello! Thank you for this open source implementation and your great research! state_norm on this line: https://github.com/YeWR/EfficientZero/blob/main/core/config.py#L57 Do you have any insights into whether normalizing the hidden state helps training...

Hey, is there a way to use your implementation with a fixed MDP dataset instead of an environment for 100% offline RL?

./cnode.h:47:42: error: a space is required between consecutive right angle brackets (use '> >') std::vector node_pools;

First of all, congratulations on the great work! I've been trying to train an agent to play breakout and the training is really slow. This is really confusing to me...

Hi, I found something wired when training EfficientZero. I trained the agent on a P40 sever which has 4 24G GPUs and 28 CPUs. But all the computed memory was...

Thanks for sharing this codebase, I was wondering if you were planning to support PyTorch Lightning in the future ?

Thank you authors for the awesome paper! I have an issue reproducing the results of Breakout. Instead of 414 claimed in the paper, I get 362.43 (average mean performance over...