reinforcement-learning-algorithms issues

Bug using SAC with torch version 1.8.0a0+963f762

1

I run the SAC code with torch (compiled version) while i encounter the error ```python RuntimeError: one of the variables needed for gradient computation has been modified by an inplace...

dmksjfl

SAC Agent still wrong with "tuple index out of range" use "MountainCar-v0"

1

I try to use "MountainCar-v0" env in sac agent but still wrong with "tuple index out of range" Could you tell me how to fix it ? Thanks Traceback (most...

StewartTsai

Plotted Reward Scale

3

Hi, I am oscar and I do appreciate those source codes with integrating various algorithms. I have tried to run the nature DQN with default setting through **Pong** and **BeamRider**...

OscarHuangWind

Could you please tell me how to use the results? ![image](https://user-images.githubusercontent.com/42888230/103749942-1e841100-5041-11eb-9567-879b93887c5a.png) I want the data just like: mean_reward & time_step

THSWind

Add prioritized experience replay

5

Thanks for the excellent implementations of multiple classic RL agents, I have tried some of them, worked very well. Just curious, do you plan to add the prioritized experience replay...

JasAva

How to visualize reward-epoch?

2

How to visualize reward-epoch? Does It need extra code for post-processing(post-process print infomation) or is there some tools for visualization? I cound not find the extra code in github project.

ShaoyuanLi

reinforcement-learning-algorithms
reinforcement-learning-algorithms copied to clipboard

Metadata

Bug using SAC with torch version 1.8.0a0+963f762