mppi_pendulum icon indicating copy to clipboard operation
mppi_pendulum copied to clipboard

Sample from approximate dynamics

Open LemonPi opened this issue 4 years ago • 5 comments

Are there any plans to extend this to approximated dynamics (e.g. with a NN) and using importance sampling instead of sampling trajectories directly from the environment? (replace __init__ env arg with a dynamics arg, then take in env for just the control method for actual stepping)

That would actually match the contributions from the 2017 paper and make it more broadly applicable. I would like to use this in an environment where I can't reset the state of the simulator, so trajectories have to be generated with the model.

LemonPi avatar Dec 25 '19 20:12 LemonPi