DeepMimic icon indicating copy to clipboard operation
DeepMimic copied to clipboard

Multiclip rewards reaches a plateau

Open tfederico opened this issue 4 years ago • 2 comments

Hello,

I tried training the character with the multiclip reward as described in the paper. However, the reward reaches a plateau and the character limps.

Do you have any suggestion about why this might be happening? Which rewards did you use to train the multiclip? The ones in the code or the ones in the paper?

tfederico avatar Sep 24 '20 10:09 tfederico

The reward in the code should work for imitating multiple walking clips. It's a bit hard to tell what might be going wrong. But as a first guess, make sure that all the reference motions are synchronized. So their duration should all be scaled to be the same length so that a single phase variable is valid for every single motion.

xbpeng avatar Sep 27 '20 01:09 xbpeng

I think I am fine with that. Have you ever tried to use the same reward for multiple clips that were different from walking? (e.g., moving arms)

tfederico avatar Sep 30 '20 11:09 tfederico