KungFu
KungFu copied to clipboard
A question about Horovod central coordinator in the paper of KungFu
The asynchronous collective communication layer also avoids having an expensive central coordinator, as used for invoking synchronous collective communication operations inexisting systems, such as Horovod.
I see the paper of Horovod and KongFu,I wonder why does Horovod use the central coordinator,I havent find it in the paper of Horovod.Could you please give me some information about it?Such as some codes.I want to compare the difference.
Thanks!Have a nice day!
is this what you are looking for https://github.com/horovod/horovod/blob/master/horovod/common/operations.cc#L359-L378
is this what you are looking for https://github.com/horovod/horovod/blob/master/horovod/common/operations.cc#L359-L378
Thanks! I see the AD-PSGD algorithm in codes.Does it relate to the collective communication layer noted in the paper?