Deep-Reinforcement-Learning-Algorithms-with-PyTorch icon indicating copy to clipboard operation
Deep-Reinforcement-Learning-Algorithms-with-PyTorch copied to clipboard

Add A2C algorithm

Open p-christ opened this issue 6 years ago • 2 comments
trafficstars

Add the A2C algorithm which is the synchronous version of the algorithm described in this paper https://arxiv.org/pdf/1602.01783.pdf and described here: https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-8-asynchronous-actor-critic-agents-a3c-c88f72a5e9f2

p-christ avatar Jan 14 '19 08:01 p-christ

Is this still being worked on? I am getting this error below from running the cartpole test.

As you can see, the hyperparameters dict has 2 parent keys: Actor, Critic. The linear_hidden_units key is embedded in each of those, thus getting a key error. Do you have working code for this? It seems that the nn_builder needs to do some complex neural net building for handling the actor and critic. I feel that instead maybe it would be better to keep them as separate neural nets?

image

josiahls avatar May 15 '19 23:05 josiahls

hi, yep this is still being worked on, thanks for pointing out this error.

I've updated it now and it works again for me (although it is a bit more messy now), does it work for you? I have them as one network because it means they get to share the knowledge gathered in the lower layers of the network and so it can be more efficient

p-christ avatar May 16 '19 07:05 p-christ