acme
acme copied to clipboard
Questions re: Custom Distributional Agents in ACME?
I have four questions towards creating custom distributional RL algorithms using Acme.
-
The only distributional agent I can find (in TF and jax) is D4PG. Is this correct?
-
When will other distributional agents (e.g. C51, Quantile Regression, Expectile Regression) be released?
-
If I want to implement my own distributional agent, I glanced at
d4pg.D4PGLearnerto get a sense of how this might be done. I'm a bit puzzled by a few things. Coming from PyTorch, I'm used to optimizers stepping. It looks like here, the Learner itself steps. Is this correct? What was the reason for this implementation choice? -
If I want to implement expectile regression for discrete control, what would the recommended approach be? I imagine I'd need to start with DQN, change the output dimension and change the loss function, but how do I specify which output element (i.e. the output units corresponding to tau = 0.5) is used for control?