pytorch-REINFORCE issues

Results 5 pytorch-REINFORCE issues

Sort by recently updated

can you give me the paper about the continuous situation?

thank you

Inconsistent policy gradient

The policy gradient here seems to be different from the policy gradient in most places, e.g., [Berkeley CS285](http://rail.eecs.berkeley.edu/deeprlcourse/static/slides/lec-5.pdf). Can the author provide where did you cite the original algorithm?

ZikangXiong

Please add some explanation

Hi, Thank you for the sample code. I could not understand what exactly is happening here: https://github.com/JamesChuanggg/pytorch-REINFORCE/blob/master/reinforce_discrete.py#L52 If possible can you please give a little explanation. Thanks

parajain

continuous-control doesn't work for MountainCarContinuous-v0

Hi. First af all - thanks for good clear code! My problem. I am trying to run this continuous-control algorithm "as is" for simplest gym's enviroments such as MountainCarContinuous-v0, Pendulum-v0...

Belerafon

normalized_actions.py line16 actions -> action

bug

Ian-Sy-Zhang

pytorch-REINFORCE
pytorch-REINFORCE copied to clipboard

Metadata

can you give me the paper about the continuous situation?

Inconsistent policy gradient

Please add some explanation

continuous-control doesn't work for MountainCarContinuous-v0

normalized_actions.py line16 actions -> action

← Metadata

Owner

Metadata

pytorch-REINFORCE pytorch-REINFORCE copied to clipboard

Metadata

can you give me the paper about the continuous situation?

Inconsistent policy gradient

Please add some explanation

continuous-control doesn't work for MountainCarContinuous-v0

normalized_actions.py line16 actions -> action

← Metadata

Owner

Metadata

pytorch-REINFORCE
pytorch-REINFORCE copied to clipboard