DLA-Future icon indicating copy to clipboard operation
DLA-Future copied to clipboard

Distributed Backtransform Tridiagonal - Check intra-epoch order performance

Open albestro opened this issue 2 years ago • 0 comments

An implementation detail of the distributed back transform tridiagonal algorithm has been chosen without performing any check.

Operations are already scheduled scanning the matrix with a smart ordering by so called "epochs", but it has been deliberately imposed an order for the intra-epoch loop.

We don't expect this cause any impact, but it is better to check that we are not missing anything that could be relevant performance-wise.

Originally posted by @albestro in https://github.com/eth-cscs/DLA-Future/pull/633#discussion_r980214302

albestro avatar Sep 26 '22 16:09 albestro