stgeke

Results 89 issues of stgeke

I can think of two possible implementations: 1. Progress thread running in the background 2. User level thread (ULT) with network interrupts

Currently we support crystal router only. This PR suggests to add pairwise and all_reduce.

Currently the interpolation step is performed on the target rank i.e. not on the source rank where interpolation points are stored. For large ratios of source/target ranks the interpolation step...

So far MPI benchmark packages do not assess the performance of sparse collectives. Possible benchmarks need to include: * different message sizes and virtual topologies * blocking and non-blocking performance

Just in case convergence of the residual stalls