Alan Williams comments

Results 32 comments of


                                            Alan Williams

Timing mis-match when threading

I will try to reproduce the discrepancy on the local blade where I can run threaded and also use vtune for another view of where the time is being spent.

Timing mis-match when threading

I think I have successfully reproduced this on my local blade with the cvfemHC nightly test. When I run with 8 mpi procs and 1 thread per mpi rank, the...

Timing mis-match when threading

It looks like there is some double-counting of time somewhere. For the hoHelium case if I sum the times printed from dump_eq_time() on equation-system, the sum is greater than total...

Timing mis-match when threading

There is some double-counting of time in the equation-system timers. An example is LowMachEquationSystem::solve_and_update calls momentumEqSys_-> compute_projected_nodal_gradient(), and time is accumulated both inside and outside that call. In general, I...

STK: Snapshot 07-14-22 09:21

Replacing this with a newer snapshot.

Broken Perlmutter GPU build possibly due to STK changes in Trilinos

We did recently start using the MPI_CXX_BOOL type in a couple of cases in stk. I'll refer this to our MPI guru and see what he thinks. I wonder if...

Broken Perlmutter GPU build possibly due to STK changes in Trilinos

Sorry for not following up on this. This issue was fixed by this stk update: https://github.com/trilinos/Trilinos/pull/10914

Nalu-wind build times on ORNL Summit for GPU builds

@sayerhs That's great, I'm glad you were able to get the per-target build times. It looks like the ngp_algorithms files are the worst offenders. I think any compilation-unit with cuda...

Nalu-wind build times on ORNL Summit for GPU builds

@sayerhs One more thing: the stk ngp stuff could be a culprit, it is all header-only, and perhaps much of it doesn't need to be header-only. I'll look into that.

Nalu-wind build times on ORNL Summit for GPU builds

@sayerhs Just from browsing the code, I wonder if we could reduce build times by splitting NgpLoopUtils.h into several separate headers. For instance I see that many .C files call...