LightZero
LightZero copied to clipboard
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
I was looking at the implementation of the minigrid environment as inspiration to create my own environment. I noticed that the correspoin efficient muzero config doesnt actually use the environment....
Suppose there is a game, a grid 10 by 10 ,each position was placed a piece of gold with a randomly positive value , and an agent do the mining...
Hello, I want to replicate all E0 Atari results to compare my custom learner against. I have SSH access to 32 CPU core + 4 GPU nodes on a cluster....
- Our work is currently in progress, focusing on developing an efficient version of MuZero. - Code are submitted for discussion.
The tutorial for creating a custom environment describes how to add the environment. However, It doesn't mention how to start the RL algorithms on it. Is there some guideline I...
- Our work is currently focused on developing a unified and scalable planning framework. - Our code is partially based on https://github.com/eloialonso/iris.
alphazero , muzero, Gumbel MuZero in go game
I attempted to replicate the sampledefficientzero results displayed in the Hopper-V3 environment's readme benchmark section using the default configuration file (zoo/mujoco/config/mujoco_sampled_efficientzero_config.py). However, I encountered two main issues during the process:...