ReinforcementLearning.jl
ReinforcementLearning.jl copied to clipboard
Fix for TicTacToeEnv allows illegal moves #1001
TicTacToeEnv allows one state to be played multiple times by the same agent. This condition should prevent this behavior and throw an error in case the state is played multiple times.