stable-baselines3-contrib icon indicating copy to clipboard operation
stable-baselines3-contrib copied to clipboard

[Question] How to test recurrent + maskable + dependent multidiscrete actions?

Open maxmax1992 opened this issue 11 months ago • 1 comments

❓ Question

Hey I plan to contribute to the repo, but not sure what env to choose as POC approval of the solution. The env needs to: test a dependent actions (each action is selected in a multidiscrete fashion, but it depends what action was chosen before. Additionally e.g some action may result into an action mask for future actions e.g choosing navigation action instead of some other high-level action). Also Ideally the policy will be recurrent (e2e). Problem is following: I had look at some repo like microRTS for python but it seems it won’t be trivial to also test recurrent policy feature as well (even though it’s good for action mask testing).

Checklist

maxmax1992 avatar Feb 05 '25 01:02 maxmax1992

Hello,

Unfortunately, I don't have any specific task in mind, you might have a look at https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/issues/101 (and ask there).

araffin avatar Feb 07 '25 08:02 araffin