mcts
mcts copied to clipboard
How are move evaluations used? (Empty list in example)
Hi,
I was just curious how move evaluations are meant to be used? In the counting game example, the evaluation returns a vector of () for each move, and just returns a state evaluation instead.
I can see in the code that there is an AlphaGo policy that uses the move evaluations (that makes sense). So I just wanted to check to see if I got this right: If I'm using the UCTPolicy, I should return only a state evaluation, and not move evaluations, and if I'm using the AlphaGoPolicy, I should return move evaluations and a state evaluation? Or does the AlphaGoPolicy want only move evaluations and not a state evaluation?
Thanks!
Hello, did you ever get this sorted?
Thanks