pytorch_a3c Asy

Asy_optimizer

Open ypxie opened this issue 7 years ago • 3 comments

Hi, thanks for this project. I have a question about the optimizer, Is it enough to simply use the original optimizer provided by pytorch? Before opt.step(). if we simply do sth like:

opt = SGD(shared_para, lr=0.01)
for l_p, s_p in zip(local_para, shared_para):
     l_p.grad.data = s_p.grad.data
opt.step()

will this be enough? Thanks!

Mar 14 '17 18:03 ypxie

@ypxie Your code is applicable to only simple optimization method which doesn't have parameters, like SGD. In the paper, they used RMSProp optimizer. It accumulates square of gradients (parameters). They said Async Optimizer should share accumulations, so I can not use the original RMSProp optimizer.

Mar 15 '17 15:03 rarilurelo

@rarilurelo thanks for your reply. Can I simply set the state of pytorch optimizer to shared_memory() and add a lock to protect it. In this way, the copy of the optimizer in different thread will be using the same state memory. Does this sound enough? Thanks!

Mar 15 '17 18:03 ypxie

@ypxie Yes!

Mar 22 '17 00:03 rarilurelo

pytorch_a3c pytorch_a3c copied to clipboard

Asy_optimizer

pytorch_a3c
pytorch_a3c copied to clipboard