Majid Laali
Majid Laali
@vmoens thanks for providing an implementation of MCTS. I am wondering if you would plan to merge this PR?
That makes sense, let me see if I can grab from your PR and add MCTS implementation. Do you have any comment / suggestion for me on completing this PR?
@vmoens thanks for explanation, I was a bit slow in implementation as I am bit busy with a few other things. I like the idea of using env to encapsulate...
Does this help? https://github.com/google-deepmind/open_spiel/blob/master/open_spiel/python/examples/tic_tac_toe_qlearner.py It shows how to roll out and train an agent.