rl-mpc-locomotion icon indicating copy to clipboard operation
rl-mpc-locomotion copied to clipboard

Observation mismatch

Open silvery107 opened this issue 1 year ago • 0 comments

The observation in training is mismatched with deployment. The base_pos should be removed.

I was trying to align the RL reward to MPC cost, but it turns out it's better to go without position tracking for both stages.

silvery107 avatar Oct 24 '24 15:10 silvery107