Modifying dopamine to accept continuous action spaces
Could you possibly comment on the difficult of getting dopamine to work with continuous action spaces? Is this something that could be done with a bit of effort, or are the agents completely incompatible with this type of action space? I wanted to have a go getting dopamine to work with the OpenAI Gym BipedalWalker environment. I managed to get all of the initialization and setup working, then realized an assumption of discreet action spaces was hard-coded into dopamine.
It probably isn't easy, the agents are incompatible with these action spaces. This was probably the toughest choice for the first version – to support continuous actions or not. It would have significantly increased code complexity, so we decided to wait until we had a better sense of where to take Dopamine next.
Ok, thanks for this, I appreciate you taking the time to answer my question.