rlpyt issues

Working example code for R2D1

8

It seems like the code in experiments/ is outdated - I've managed to update the import errors and get most of the code running, however I get the following issue...

DavidMChan

Typing fix

1

Type comments are not prevalent in the codebase, but some type inference can be done automatically (e.g. using class definitions). This type comment is syntactically incorrect in that it causes...

jordan-schneider

Correct way to send AsyncReplayBuffers to new processes

9

I'm trying to do multi-GPU DistributedDataParallel training using an AsyncPrioritizedSequenceReplayFrameBuffer, and I'm having trouble passing a buffer created in the parent to child processes. When I directly pass the buffer...

MaxASchwarzer

Can custom environments return different namedtuples at different steps for the last element of the tuple returned by Env.step?

4

Hi Alex, Question is in the title. Basically, I have an environment where I need to record a variable number of additional pieces of information about the environment, and on...

bpiv400

log_std exploding in GaussianPgAgent (MujocoFfAgent)

2

Hello Thanks for a great library! I want to apply PPO implementation to my own environment. I am using MujocoFfAgent and encountered error that I cannot fix. Maybe you can...

kzorina

Pretrained models

11

Is there any chance you could release the pretrained models for the implemented algorithms? It would accelerate research for some, and help those without access to good hardware.

juliusfrost

Windows Requires Pytorch >= 1.3

3

Hi Adam, I wanted to let you know that on Windows, you need Pytorch 1.3 or later to use your implementation of PPO. When using Pytorch 1.2, calls to `PPO.optimize_agent`...

bpiv400

Support for weird graph-based observation data type

6

Hello, I would like to use your framework in my research for its multithreading features but I have a bit of a weird MDP. the state is a pytorch_geometric graph-structured...

tarungog

Unique distribution class

2

Is there a reason why the base distribution class doesn't inherit from pytorch's Distribution class? It would be a lot more convenient if I could rely on the two being...

jordan-schneider

Batches with additional observations

4

Apologies if this has already been answered before, but I am trying to samples batches of the form (s_t,a_t,r_t,s_t+1,s_t+2,..,s_t+K) where K is a parameter. How would one go about doing...

bmazoure

rlpyt
rlpyt copied to clipboard

Metadata

Working example code for R2D1

Typing fix

Correct way to send AsyncReplayBuffers to new processes

Can custom environments return different namedtuples at different steps for the last element of the tuple returned by Env.step?

log_std exploding in GaussianPgAgent (MujocoFfAgent)

Pretrained models

Windows Requires Pytorch >= 1.3

Support for weird graph-based observation data type

Unique distribution class

Batches with additional observations

← Metadata

Owner

Metadata

rlpyt rlpyt copied to clipboard

Metadata

← Metadata

Owner

Metadata

rlpyt
rlpyt copied to clipboard