Simon Boehm
Results
33
comments of
Simon Boehm
1. Definitely makes sense to have D be the inner loop (this is just how minibatching works in deep learning). 2. When we first talked about this 2 weeks ago,...
https://github.com/siboehm/CPU_bench/blob/master/gosrc/bandwidth.go
I wouldn't use a different protocol, mostly because if we just MPI for everything we only have to do the MPI_World setup once. With a different protocol it'll get complicated...