DeepMimic Multiclip rewards reaches a plateau

Multiclip rewards reaches a plateau

Open tfederico opened this issue 4 years ago • 2 comments

Hello,

I tried training the character with the multiclip reward as described in the paper. However, the reward reaches a plateau and the character limps.

Do you have any suggestion about why this might be happening? Which rewards did you use to train the multiclip? The ones in the code or the ones in the paper?

Sep 24 '20 10:09 tfederico

The reward in the code should work for imitating multiple walking clips. It's a bit hard to tell what might be going wrong. But as a first guess, make sure that all the reference motions are synchronized. So their duration should all be scaled to be the same length so that a single phase variable is valid for every single motion.

Sep 27 '20 01:09 xbpeng

I think I am fine with that. Have you ever tried to use the same reward for multiple clips that were different from walking? (e.g., moving arms)

Sep 30 '20 11:09 tfederico

DeepMimic DeepMimic copied to clipboard

Multiclip rewards reaches a plateau

DeepMimic
DeepMimic copied to clipboard