What is IaGo?

How to play?

Install chainer
$ pip install chainer
Download this repository
$ git clone [email protected]:shionhonda/IaGo.git
Move to IaGo directory and execute game.py
$ python game.py
You can set following options:
--auto=False or --a=False
If this is set True, autoplay begins between SLPolicy and PV-MCTS, and if False (default), the game is played by you and PV-MCTS.
The thinking time is 10 seconds.
When placing a stone, input two numbers separated by comma. For example:
4,3
The first number corresponds to the vertical position and the second to the horizontal (one origin).

Download data from http://meipuru-344.hatenablog.com/entry/2017/11/27/205448
Save it as "IaGo/data/data.txt"
Augment data
$ python load.py
You need at least 32MB RAM to complete this step.
Execute train_policy.py to train SL policy network.
$ python train_policy.py --policy=sl --epoch=10 --gpu=0
You need GPUs to complete this step. It will take about 12 hours.
Execute train_policy.py to train rollout policy.
$ python train_policy.py --policy=rollout --epoch=1 --gpu=0
This is fast.
Execute train_rl.py to reinforce SL policy network with REINFORCE (a kind of policy gradients).
$ python train_rl.py --set=10000
Execute train_value.py to train value network.
$ python train_value.py --epoch=20 --gpu=0
Training done!

Special thanks to:
@Rochestar-NRT for replication of AlphaGo (especially MCTS).
@lazmond3 for giving lots of feedbacks!