rl-mpc-locomotion Observation mismatch

Observation mismatch

Open silvery107 opened this issue 1 year ago • 0 comments

The observation in training is mismatched with deployment. The base_pos should be removed.

I was trying to align the RL reward to MPC cost, but it turns out it's better to go without position tracking for both stages.

Oct 24 '24 15:10 silvery107