pytorch-REINFORCE icon indicating copy to clipboard operation
pytorch-REINFORCE copied to clipboard

PyTorch Implementation of REINFORCE for both discrete & continuous control

Results 5 pytorch-REINFORCE issues
Sort by recently updated
recently updated
newest added

The policy gradient here seems to be different from the policy gradient in most places, e.g., [Berkeley CS285](http://rail.eecs.berkeley.edu/deeprlcourse/static/slides/lec-5.pdf). Can the author provide where did you cite the original algorithm?

Hi, Thank you for the sample code. I could not understand what exactly is happening here: https://github.com/JamesChuanggg/pytorch-REINFORCE/blob/master/reinforce_discrete.py#L52 If possible can you please give a little explanation. Thanks

Hi. First af all - thanks for good clear code! My problem. I am trying to run this continuous-control algorithm "as is" for simplest gym's enviroments such as MountainCarContinuous-v0, Pendulum-v0...