Simon Boehm

Results 33 comments of Simon Boehm

1. Definitely makes sense to have D be the inner loop (this is just how minibatching works in deep learning). 2. When we first talked about this 2 weeks ago,...

https://github.com/siboehm/CPU_bench/blob/master/gosrc/bandwidth.go

I wouldn't use a different protocol, mostly because if we just MPI for everything we only have to do the MPI_World setup once. With a different protocol it'll get complicated...