Policy-Gradient-Methods
Policy-Gradient-Methods copied to clipboard

Published 20 hours ago •

cyoon1729

→

Metadata

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Reame
Issues

Results 2 Policy-Gradient-Methods issues

Sort by recently updated

Is your DDPG implement missing adding noise?

I find you write: self.noise = OUNoise.... but you didn't add the noise to the action?

liuxiaotong15

Query on SAC2018.py file

Could you give reference to paper as to why you chose to make two soft-q networks because they are independently working and you are taking the minimum of both while...

sprakashdash

About

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

pytorch

reinforcement-learning

a2c

a3c

ddpg

soft-actor-critic

td3

pytorch-rl

policy-gradients

Stars

Forks

Watchers

Owner

cyoon1729

← Metadata

Stars

Forks

Watchers

Owner

cyoon1729

Metadata

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Back

Policy-Gradient-Methods Policy-Gradient-Methods copied to clipboard

Metadata

Is your DDPG implement missing adding noise?

Query on SAC2018.py file

← Metadata

Owner

Metadata

Policy-Gradient-Methods
Policy-Gradient-Methods copied to clipboard