How are move evaluations used? (Empty list in example)

Open xtagon opened this issue 6 years ago • 1 comments

Hi,

I was just curious how move evaluations are meant to be used? In the counting game example, the evaluation returns a vector of () for each move, and just returns a state evaluation instead.

I can see in the code that there is an AlphaGo policy that uses the move evaluations (that makes sense). So I just wanted to check to see if I got this right: If I'm using the UCTPolicy, I should return only a state evaluation, and not move evaluations, and if I'm using the AlphaGoPolicy, I should return move evaluations and a state evaluation? Or does the AlphaGoPolicy want only move evaluations and not a state evaluation?

Thanks!

Sep 06 '19 03:09 xtagon

Hello, did you ever get this sorted?

Thanks

Jul 16 '22 21:07 lewis-carson