tensorflow-rl-tictactoe
tensorflow-rl-tictactoe copied to clipboard

→

Metadata

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Readme
Issues

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Requirements:

TensorFlow (https://www.tensorflow.org/versions/r0.10/get_started/os_setup.html)
Colorama (pip install colorama)

References:

Michael L. Littman. Markov games as a framework for multi-agent reinforcement learning. Machine Learning, 11:157–163, 1994.
W. T. Uther and M. Veloso. Adversarial reinforcement learning, School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, 1997.
R. A. C. Bianchi, C. H. C. Ribeiro, and A. H. R. Costa. Heuristic selection of actions in multiagent reinforcement learning. In IJCAI’07, Hyderabad, India, 2007.

About

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

33

Stars

16

Forks

Watchers

Owner

← Metadata

33

Stars

16

Forks

Watchers

Owner

Metadata

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.