sheeprl icon indicating copy to clipboard operation
sheeprl copied to clipboard

Algorithm Request: more DQN-based approaches

Open samlobel opened this issue 1 year ago • 2 comments

I am thinking of using sheeprl as the base for my RL experiments! My work usually builds off of DQN-type algorithms: in increasing level of complexity, off of DDQN, Rainbow, or R2D2. Having some of these implemented would make this library much more convenient for research IMO.

It's super cool that you have Dreamer and Plan2Explore implemented, but that's not a great starting point for RL research because of how complex and opinionated they are. It would be great to just have a simple DDQN baseline implemented! All the PPO and A2C stuff is great if you work in the online setting, but DQN type things would let us build things for batch training. In a perfect world it'd also have an implementation of something like R2D2 since that's a good SOTA-ish DQN upgrade.

Excited to start trying it out!

samlobel avatar Aug 07 '23 18:08 samlobel

Hi @samlobel and thank you for using sheeprl for your experiments! If you want we can start with a standard DDQN agent: you can have a look at our PPO implementations from this branch where we have the possibility to encode both images and vectors. We can start after the mentioned branch is merged, hopefully this week

belerico avatar Aug 08 '23 09:08 belerico

@samlobel you can now try out directly the main branch

belerico avatar Aug 09 '23 13:08 belerico