alpha-zero-general Generaliz MCTS for games with reward on each game step

Generaliz MCTS for games with reward on each game step

Open cfytrok opened this issue 3 years ago • 1 comments

What are the difficulties in using the algorithm for games that return reward on each step? And for infitit games? It seems, that you just need to slightly change the MCTS algorithm. That is, take into account the reward when calculating Q. It is also necessary that the getNextState function of the game additionally returns the reward.

Sep 21 '20 18:09 cfytrok

alpha-zero-general alpha-zero-general copied to clipboard

Generaliz MCTS for games with reward on each game step

alpha-zero-general
alpha-zero-general copied to clipboard