ReinforcementLearning.jl issues

Question: Can ReinforcementLearning.jl handle Partially Observed Markov Processes (POMDPs)?

10

Hello, I was just wondering if the ReinforcementLearning.jl package can handle Partially Observed MDPs? I know that the POMDP.jl package can work with these, but the interface is very different....

00krishna

Recurrent Models

8

Does it this repo support recurrent models (LSTM for example)?

lorrp1

RLZoo

Gain in VPGPolicy does not account for terminal states?

3

I think we need to pass `traj[:terminal]` to `discount_rewards` so that the gain is computed only up to termination of an episode? https://github.com/JuliaReinforcementLearning/ReinforcementLearning.jl/blob/6fe6aa01208c325f8f990032621c18b61d574b37/src/ReinforcementLearningZoo/src/algorithms/policy_gradient/vpg.jl#L105

ArjunNarayanan

JuliaRL_BasicDQN_CartPole example fails

5

In the website https://juliareinforcementlearning.org/, there are the three lines ```julia julia> ] add ReinforcementLearningExperiments julia> using ReinforcementLearningExperiments julia> run(E`JuliaRL_BasicDQN_CartPole`) ``` However, the third one gives (on Julia v1.7.1) ```julia julia>...

blegat

help wanted

Refactor of DQN Algorithms

3

I've started refactoring the DQN implementations, but I'm fairly new to Julia so I'd appreciate your feedback about whether this is a good idea or not. In essence, it looks...

harwiltz

How to display/render AtariEnv?

3

Haven't found any docs about this. Can't figure how to use the following code in src/environments/3rd_party/atari.jl: ```julia imshowgrey(x::AbstractArray{UInt8,2}) = imshowgrey(reshape(x, :), size(x)) imshowgrey(x::AbstractArray{UInt8,1}, dims) = imshow(reshape(x, dims), colormap = 2)...

bhatiaabhinav

Rename some functions to help beginners navigate source code

3

I only realize this problem very recently. Multiple dispatch seems to be overused here in this package. For example, the `update!` function. I thought it was quite straightforward. When we...

findmyway

enhancement

P1

WIP: PETS algorithm from facebook/mbrl

PR Checklist - [ ] Update NEWS.md? Base implementation of [PETS](https://arxiv.org/abs/1805.12114) (see also [facebookresearch/mbrl-lib](https://github.com/facebookresearch/mbrl-lib)) that is currently not working. I might not have a lot of time for this in...

albheim

Support multiple discrete action space

A2C and PPO can be improved further to support mutiple discrete action space

findmyway

enhancement

good first issue

Support CircularVectorSARTTrajectory RLZoo

Most deep rl algorithms in RLZoo assume the state to be an array. However, states of a graph or any other data structure should also be supported out of the...

findmyway

enhancement

RLZoo

ReinforcementLearning.jl
ReinforcementLearning.jl copied to clipboard

Metadata

Question: Can ReinforcementLearning.jl handle Partially Observed Markov Processes (POMDPs)?

Recurrent Models

Gain in VPGPolicy does not account for terminal states?

JuliaRL_BasicDQN_CartPole example fails

Refactor of DQN Algorithms

How to display/render AtariEnv?

Rename some functions to help beginners navigate source code

WIP: PETS algorithm from facebook/mbrl

Support multiple discrete action space

Support CircularVectorSARTTrajectory RLZoo

← Metadata

Owner

Metadata

ReinforcementLearning.jl ReinforcementLearning.jl copied to clipboard

Metadata

← Metadata

Owner

Metadata

ReinforcementLearning.jl
ReinforcementLearning.jl copied to clipboard