DeepMimic icon indicating copy to clipboard operation
DeepMimic copied to clipboard

Is DeepMimic be trained using A3C or A2C?

Open Zju-George opened this issue 5 years ago • 1 comments

A3C: aka Asynchronous Advantage Actor Critic

It uses MPI, so I wonder if DeepMimic be trained using A3C?

Zju-George avatar Aug 27 '19 09:08 Zju-George

Neither, we are using PPO for training. The implementation with MPI is using synchronous updates, so it's more akin to A2C.

xbpeng avatar Aug 28 '19 00:08 xbpeng