muzero-general
muzero-general copied to clipboard
MuZero
### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar bug report. ### 🐛 Describe the bug I noticed that dirichlet noise is added...
This is a work-in-progress implementation of [Sampled Muzero ](https://arxiv.org/pdf/2104.06303.pdf) that I've been working on. I figured I'd store the implementation here in case anyone else is interested in developing it....
Hi, Can I please ask that what does Mean_value plot and also its significance in reinforcement learning (specific to this algorithm). I tried to understand this from the code but...
### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar bug report. ### 🐛 Describe the bug ![Screen Shot 2022-11-30 at 4 31 51...
### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar feature requests. ### Description Hi, I will appreciate if someone would suggest me that...
Thanks for you open-sourced code very much. I am very confused about this code segment in [backpropagate](https://github.com/werner-duvaud/muzero-general/blob/master/self_play.py#L406) method in self_play.py: when len(self.config.players) is 2, - in line [423](https://github.com/werner-duvaud/muzero-general/blob/master/self_play.py#L423): `min_max_stats.update(node.reward +...
### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar feature requests. ### Description Perfect ideas and codes In my game, there is only...
### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar bug report. ### 🐛 Describe the bug I run the script in both 3.7...