ScalphaGoZero Why is the input to the NN structured differently from original alphaGo zero?

Why is the input to the NN structured differently from original alphaGo zero?

Open barrybecker4 opened this issue 5 years ago • 0 comments

I noticed that the way that the game state is sent to the input of the NN (see ZeroEncoder) is different from what is described here. In the cheat sheet, there are 14 layers that represent the positions of the black and white stones for the last seven moves (7 for black + 7 for white). In ScalphaGoZero, there are 8 layers that represent the stones with different numbers of liberties (4 for black stones with liberties 1, 2, 3, 4+ respectively, and same for white). Also, ScalphaGoZer has a layer for illegal ko moves that is not indicated by the cheat sheet.

I can see how both approaches might be useful. Which of these approaches gives better results? Which of these approaches matches the original Deepmind implementation more closely?

Apr 12 '19 13:04 barrybecker4

ScalphaGoZero ScalphaGoZero copied to clipboard

Why is the input to the NN structured differently from original alphaGo zero?

ScalphaGoZero
ScalphaGoZero copied to clipboard