Mava
Mava copied to clipboard
Feature/make continuous ppo
What?
Added continuous network option for IPPO.
Why?
Extending functionality for IPPO implementation.
How?
Added function to create IPPO network with a clipped Gaussian function instead of a categorical policy.
Extra
Closes issue #631
-~~The get_action(...)
function needs an if statement to check the network type in use since the discrete actions are of type int64
and continuous actions are of type float32
~~
- ~~Need to make custom networks to clip distribution. Functionality is present in
acme.tf
but notacme.jax
~~ - NaN's generated in Multiwalker
@siddarthsingh1 please create and link an issue for this within the correct milestone (IPPO)
We are closing this PR as we are focussing on the JAX release. We will re-open it when needed again in the future.