muzero-general issues

Results 58 muzero-general issues

Sort by recently updated

MuZero Unplugged

Hey, I'm wondering if there is any intention to expand the code basis for MuZero unplugged to make it work in an offline RL setting?

tbskrpmnns

enhancement

question

Why is root.visit_count initialized to 0 and root_predicted_value not included in root node value?

The MCTS implementation here works roughly like this (pseudocode): ```python def mcts(observation): root_predicted_value, stuff = model.initial_inference(observation) root = Node() root.expand(stuff) root.add_exploration_noise() for _ in range(num_simulations): leaf = find_unexpanded_leaf() # here...

dniku

enhancement

procgen

Sorry I'm a newbie , how would I implement this such that it runs on procgen env? thank you

hlsfin

enhancement

question

2 players moving simultaneously

Hello, Every 2 player game implemented is turn based. Do you mind providing an example or advising on how to make a game where both players make simultaneous turns? Also,...

omgmax

question

If I know the environment, is it better to train alphazero?

If I have access to the environment model, is it faster/better to train alphazero instead? thanks

omgmax

question

Self-play very slow and inefficient on GPU (self.selfplay_on_gpu = True)

Hello, I've been having issues with doing self-play on GPU, and after about a week of experimentation I've realized that it is necessary to use this option if I want...

kevaday

enhancement

muzero-general
muzero-general copied to clipboard

Metadata

MuZero Unplugged

Why is root.visit_count initialized to 0 and root_predicted_value not included in root node value?

procgen

2 players moving simultaneously

If I know the environment, is it better to train alphazero?

Self-play very slow and inefficient on GPU (self.selfplay_on_gpu = True)

File not found ray distributed cluster worker node save checkpoint

ray.exceptions.RayActorError: The actor died unexpectedly before finishing this task.

added support for multiple dimension continuous action spaces

Faster calculations in self_play.py

← Metadata

Owner

Metadata

muzero-general muzero-general copied to clipboard

Metadata

← Metadata

Owner

Metadata

muzero-general
muzero-general copied to clipboard