stgeke
Results
89
issues of
stgeke
I can think of two possible implementations: 1. Progress thread running in the background 2. User level thread (ULT) with network interrupts
Currently we support crystal router only. This PR suggests to add pairwise and all_reduce.
Currently the interpolation step is performed on the target rank i.e. not on the source rank where interpolation points are stored. For large ratios of source/target ranks the interpolation step...
So far MPI benchmark packages do not assess the performance of sparse collectives. Possible benchmarks need to include: * different message sizes and virtual topologies * blocking and non-blocking performance
Just in case convergence of the residual stalls