ReinforcementLearning.jl icon indicating copy to clipboard operation
ReinforcementLearning.jl copied to clipboard

A reinforcement learning package for Julia

Results 117 ReinforcementLearning.jl issues
Sort by recently updated
recently updated
newest added

Here goes a fix and associated unit test for issue "Wrong style for state report for TicTacToeEnv() #1079"

A call to `state(env::TicTacToeEnv, Observation{BitArray{3}}())` does not result in the correct style. The error can be "seen" using: ```julia env = TicTacToeEnv() display(state(env, Observation{String}(), current_player(env))) # works as expected display(state(env,...

The two functions reward(::TicTacToeEnv,::Player) s_terminated(::TicTacToeEnv) result in a small but needless allocation due to a type instability in call to `get_tic_tac_toe_state_info()` To see this, you can use: ```julia using ReinforcementLearning...

First of all, I would like to say thank you to all of the contributors of this useful package! I am a learner of both RL and this package. I...

TicTacToeEnv allows one state to be played multiple times by the same agent. This condition should prevent this behavior and throw an error in case the state is played multiple...

PR Checklist - [ ] Update NEWS.md? - [ ] Unit tests for all structs / functions? - [ ] Integration and correctness tests using a simple env? - [...

I am running into limitations of the current design of the `run` loop: Let's assume I am using a custom policy that internally stores the history of past observations and...