rlpyt icon indicating copy to clipboard operation
rlpyt copied to clipboard

Reinforcement Learning in PyTorch

Results 64 rlpyt issues
Sort by recently updated
recently updated
newest added

Hi. I am new to rlpyt and I ran the dqn_async_gpu example. I looked at the logs at /data/local and I was trying to find the episode rewards. I suspect...

Since HER applies to any off policy algorithm, I think this would be useful for researchers studying sparse reward problems. I can take a crack at this and submit a...

My program fails with the following error when I try to use GpuSampler or CpuSampler (SerialSampler works normally) XIO: fatal IO error 11 (Resource temporarily unavailable) on X server "localhost:11.0"...

I'm trying to run the examples as a sanity check but they don't seem to be learning. I tried examples 1 and 3. The only change I made to example...

now I use the multi-thread. and when my training code is terminated, but it does not kill pid. how can solve this?

Hi, In the [rlpyt/rlpyt/ul](https://github.com/astooke/rlpyt/tree/master/rlpyt/ul) directory, description says: "See "experiments" folder for scripts to run online RL agents with ATC ... and **other RL agents to gather expert demonstrations**. However, I...

Hi! Thanks for your careful description of the library and I meet a question when reading it. When I think about the relation between the CategoricalPgAgent and the MujocoFfModel, I...

Dear @astooke Our team is trying to reproduce the decoupling results for Atari games from [paper](https://arxiv.org/abs/2009.08319). More specifically, we are trying to reproduce the results from figure 4. To achieve...

Hi, I'm trying to run DQN with asynchronous sampling using rlpyt's async sampler and runner classes. However, it looks like they don't work with CPU only, and require the presence...

In, e.g., PPO (though this also applies to at least A2C; I didn't check any others), when OptInfo is being populated, `.item` is called on most of the fields but...