reinforcement-learning-algorithms icon indicating copy to clipboard operation
reinforcement-learning-algorithms copied to clipboard

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...

Results 6 reinforcement-learning-algorithms issues
Sort by recently updated
recently updated
newest added

I run the SAC code with torch (compiled version) while i encounter the error ```python RuntimeError: one of the variables needed for gradient computation has been modified by an inplace...

I try to use "MountainCar-v0" env in sac agent but still wrong with "tuple index out of range" Could you tell me how to fix it ? Thanks Traceback (most...

Hi, I am oscar and I do appreciate those source codes with integrating various algorithms. I have tried to run the nature DQN with default setting through **Pong** and **BeamRider**...

Could you please tell me how to use the results? ![image](https://user-images.githubusercontent.com/42888230/103749942-1e841100-5041-11eb-9567-879b93887c5a.png) I want the data just like: mean_reward & time_step

Thanks for the excellent implementations of multiple classic RL agents, I have tried some of them, worked very well. Just curious, do you plan to add the prioritized experience replay...

How to visualize reward-epoch? Does It need extra code for post-processing(post-process print infomation) or is there some tools for visualization? I cound not find the extra code in github project.