minimalRL icon indicating copy to clipboard operation
minimalRL copied to clipboard

Add new algorithms

Open rahulptel opened this issue 4 years ago • 7 comments

It would be nice to add the following algorithms:

  • [ ] RAINBOW
  • [x] A2C (multiprocessing)

I will submit a PR if I finish any of them.

rahulptel avatar Jul 16 '19 13:07 rahulptel

Hi! I think A2C (synchronous update version of A3C) is good. What about implementing RAINBOW rather than Double, Dueling DQN? I think the significance of the code to both Double and Dueling DQN is marginal because they are small variations of DQN in terms of implementation. In contrast, a simple implementation of the RAINBOW might be helpful for many people. (Actually, Dueling and Double DQN are 2 components of RAINBOW out of 6) https://arxiv.org/abs/1710.02298

seungeunrho avatar Jul 16 '19 16:07 seungeunrho

Agreed. We can go with RAINBOW.

rahulptel avatar Jul 16 '19 16:07 rahulptel

Awesome!

seungeunrho avatar Jul 17 '19 00:07 seungeunrho

MuZero would also be a cool algorithm, it is a bit more complicated with the MCTS but it works very well

BDEvan5 avatar Jun 11 '20 10:06 BDEvan5

Also, thanks so much for sharing. These are great simple implementations for learning and have been very useful.

If you want to try something else, you could also try to implement them in TensorFlow

BDEvan5 avatar Jun 11 '20 10:06 BDEvan5

How about SAC?

ADGEfficiency avatar Jul 30 '20 11:07 ADGEfficiency

How about Phasic Policy Gradient (PPG) as it gives better results than PPO? Also an example of using these algorithms for non gaming environment like ones with list, dict etc as observation instead of image frames. I guess that will be easy as we will have to use NN instead of CNN. Still a simple example, may be.

Mahesha999 avatar Apr 05 '21 21:04 Mahesha999