stable-baselines3-contrib icon indicating copy to clipboard operation
stable-baselines3-contrib copied to clipboard

[Feature Request] STAC algorithm

Open EloyAnguiano opened this issue 1 year ago • 4 comments

🚀 Feature

Build the STAC algorithm as a callable algorithm: https://arxiv.org/pdf/2002.12928.pdf

Motivation

Hyperparametrization is one of the most time/cost expensive thing when training RL agents. May be this implementation saves some time/cost to some people and it could be the first AC algorithms that deals with meta-gradients to make improvements from here.

Pitch

I would like some to guide me of where to start or to give me some key insights of the posibilities of coding this.

Alternatives

The alternatives are that someone codes it by him/herself.

Additional context

No response

Checklist

  • [X] I have checked that there is no similar issue in the repo
  • [X] If I'm requesting a new feature, I have proposed alternatives

EloyAnguiano avatar Jan 04 '24 10:01 EloyAnguiano

Hello, are you willing to implement and benchmark the algorithm?

araffin avatar Jan 10 '24 12:01 araffin

Yes, I would like to try to do so. Is there any oficial benchmark to do so or some coding guides?

EloyAnguiano avatar Jan 12 '24 14:01 EloyAnguiano

The algorithm is an Off-policy one. Is there any way or example to begin with this kind of algorithms?

EloyAnguiano avatar Jan 12 '24 14:01 EloyAnguiano

The algorithm is an Off-policy one. Is there any way or example to begin with this kind of algorithms?

https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/4 and please read https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/CONTRIBUTING.md

araffin avatar Jan 12 '24 15:01 araffin