minimalRL Add new algorithms

Add new algorithms

Open rahulptel opened this issue 4 years ago • 7 comments

It would be nice to add the following algorithms:

[ ] RAINBOW
[x] A2C (multiprocessing)

I will submit a PR if I finish any of them.

Jul 16 '19 13:07 rahulptel

Hi! I think A2C (synchronous update version of A3C) is good. What about implementing RAINBOW rather than Double, Dueling DQN? I think the significance of the code to both Double and Dueling DQN is marginal because they are small variations of DQN in terms of implementation. In contrast, a simple implementation of the RAINBOW might be helpful for many people. (Actually, Dueling and Double DQN are 2 components of RAINBOW out of 6) https://arxiv.org/abs/1710.02298

Jul 16 '19 16:07 seungeunrho

Agreed. We can go with RAINBOW.

Jul 16 '19 16:07 rahulptel

Awesome!

Jul 17 '19 00:07 seungeunrho

MuZero would also be a cool algorithm, it is a bit more complicated with the MCTS but it works very well

Jun 11 '20 10:06 BDEvan5

Also, thanks so much for sharing. These are great simple implementations for learning and have been very useful.

If you want to try something else, you could also try to implement them in TensorFlow

Jun 11 '20 10:06 BDEvan5

How about SAC?

Jul 30 '20 11:07 ADGEfficiency

How about Phasic Policy Gradient (PPG) as it gives better results than PPO? Also an example of using these algorithms for non gaming environment like ones with list, dict etc as observation instead of image frames. I guess that will be easy as we will have to use NN instead of CNN. Still a simple example, may be.

Apr 05 '21 21:04 Mahesha999

minimalRL minimalRL copied to clipboard

Add new algorithms

minimalRL
minimalRL copied to clipboard