Deep-Reinforcement-Learning-Algorithms-with-PyTorch Add A2C algorithm

Add A2C algorithm

Open p-christ opened this issue 6 years ago • 2 comments

trafficstars

Add the A2C algorithm which is the synchronous version of the algorithm described in this paper https://arxiv.org/pdf/1602.01783.pdf and described here: https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-8-asynchronous-actor-critic-agents-a3c-c88f72a5e9f2

Jan 14 '19 08:01 p-christ

Is this still being worked on? I am getting this error below from running the cartpole test.

As you can see, the hyperparameters dict has 2 parent keys: Actor, Critic. The linear_hidden_units key is embedded in each of those, thus getting a key error. Do you have working code for this? It seems that the nn_builder needs to do some complex neural net building for handling the actor and critic. I feel that instead maybe it would be better to keep them as separate neural nets?

May 15 '19 23:05 josiahls

hi, yep this is still being worked on, thanks for pointing out this error.

I've updated it now and it works again for me (although it is a bit more messy now), does it work for you? I have them as one network because it means they get to share the knowledge gathered in the lower layers of the network and so it can be more efficient

May 16 '19 07:05 p-christ

Deep-Reinforcement-Learning-Algorithms-with-PyTorch Deep-Reinforcement-Learning-Algorithms-with-PyTorch copied to clipboard

Add A2C algorithm

Deep-Reinforcement-Learning-Algorithms-with-PyTorch
Deep-Reinforcement-Learning-Algorithms-with-PyTorch copied to clipboard