rlpyt
rlpyt copied to clipboard
Reinforcement Learning in PyTorch
It seems like the code in experiments/ is outdated - I've managed to update the import errors and get most of the code running, however I get the following issue...
Type comments are not prevalent in the codebase, but some type inference can be done automatically (e.g. using class definitions). This type comment is syntactically incorrect in that it causes...
I'm trying to do multi-GPU DistributedDataParallel training using an AsyncPrioritizedSequenceReplayFrameBuffer, and I'm having trouble passing a buffer created in the parent to child processes. When I directly pass the buffer...
Hi Alex, Question is in the title. Basically, I have an environment where I need to record a variable number of additional pieces of information about the environment, and on...
Hello Thanks for a great library! I want to apply PPO implementation to my own environment. I am using MujocoFfAgent and encountered error that I cannot fix. Maybe you can...
Is there any chance you could release the pretrained models for the implemented algorithms? It would accelerate research for some, and help those without access to good hardware.
Hi Adam, I wanted to let you know that on Windows, you need Pytorch 1.3 or later to use your implementation of PPO. When using Pytorch 1.2, calls to `PPO.optimize_agent`...
Hello, I would like to use your framework in my research for its multithreading features but I have a bit of a weird MDP. the state is a pytorch_geometric graph-structured...
Is there a reason why the base distribution class doesn't inherit from pytorch's Distribution class? It would be a lot more convenient if I could rely on the two being...
Apologies if this has already been answered before, but I am trying to samples batches of the form (s_t,a_t,r_t,s_t+1,s_t+2,..,s_t+K) where K is a parameter. How would one go about doing...