tensorflow-rl-tictactoe icon indicating copy to clipboard operation
tensorflow-rl-tictactoe copied to clipboard

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Requirements:

  • TensorFlow (https://www.tensorflow.org/versions/r0.10/get_started/os_setup.html)
  • Colorama (pip install colorama)

References:

  • Michael L. Littman. Markov games as a framework for multi-agent reinforcement learning. Machine Learning, 11:157–163, 1994.
  • W. T. Uther and M. Veloso. Adversarial reinforcement learning, School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, 1997.
  • R. A. C. Bianchi, C. H. C. Ribeiro, and A. H. R. Costa. Heuristic selection of actions in multiagent reinforcement learning. In IJCAI’07, Hyderabad, India, 2007.