ReinforcementLearning.jl icon indicating copy to clipboard operation
ReinforcementLearning.jl copied to clipboard

A reinforcement learning package for Julia

Results 117 ReinforcementLearning.jl issues
Sort by recently updated
recently updated
newest added

Hello, I was just wondering if the ReinforcementLearning.jl package can handle Partially Observed MDPs? I know that the POMDP.jl package can work with these, but the interface is very different....

Does it this repo support recurrent models (LSTM for example)?

RLZoo

I think we need to pass `traj[:terminal]` to `discount_rewards` so that the gain is computed only up to termination of an episode? https://github.com/JuliaReinforcementLearning/ReinforcementLearning.jl/blob/6fe6aa01208c325f8f990032621c18b61d574b37/src/ReinforcementLearningZoo/src/algorithms/policy_gradient/vpg.jl#L105

In the website https://juliareinforcementlearning.org/, there are the three lines ```julia julia> ] add ReinforcementLearningExperiments julia> using ReinforcementLearningExperiments julia> run(E`JuliaRL_BasicDQN_CartPole`) ``` However, the third one gives (on Julia v1.7.1) ```julia julia>...

help wanted

I've started refactoring the DQN implementations, but I'm fairly new to Julia so I'd appreciate your feedback about whether this is a good idea or not. In essence, it looks...

Haven't found any docs about this. Can't figure how to use the following code in src/environments/3rd_party/atari.jl: ```julia imshowgrey(x::AbstractArray{UInt8,2}) = imshowgrey(reshape(x, :), size(x)) imshowgrey(x::AbstractArray{UInt8,1}, dims) = imshow(reshape(x, dims), colormap = 2)...

I only realize this problem very recently. Multiple dispatch seems to be overused here in this package. For example, the `update!` function. I thought it was quite straightforward. When we...

enhancement
P1

PR Checklist - [ ] Update NEWS.md? Base implementation of [PETS](https://arxiv.org/abs/1805.12114) (see also [facebookresearch/mbrl-lib](https://github.com/facebookresearch/mbrl-lib)) that is currently not working. I might not have a lot of time for this in...

A2C and PPO can be improved further to support mutiple discrete action space

enhancement
good first issue

Most deep rl algorithms in RLZoo assume the state to be an array. However, states of a graph or any other data structure should also be supported out of the...

enhancement
RLZoo