Erik Kluzek
Erik Kluzek
We should be able to show this as b4b-dev, so will likely rebase to b4b-dev, once we can show that.
@johnmauff and I worked on the code to get the mpi_scan testing working. I need to fill out a few more things and make sure it all works, but this...
OK, I have the new code functional now, and tried it out with mpas3p75. Here's timing results, which does improve, clm_init2 was 42 sec. ``` clm_init2 10240 10240 1 26.3516...
> Erik, I am confused here. I thought that the huge initialization time we were attempting to eliminate was in decomp_and_domain_from_readmesh. What impact did the MPI_scan have on that section...
It looks like I spent something on the order of a total week of my software development time over the last two weeks.
OK, we are able to show that the new code reduces the initialization time spent in decompInit_lnd for the mpasa3p75 grid with 40k processors from 1972 seconds to 13.5 seconds.
Most of the credit really goes to @johnmauff as he was the one that knew what to do. And that was the big key in doing this work. But, I...
> Hey Erik. I appreciate all of your work on the SIF project. How close are you to being able to wrap it up? I'm working towards getting the branches...
0.2 weeks in sprint 23.
From meeting with @johnmauff @briandobbins and @wwieder and I we decided I will cherry-pick just the updates to the decomp files (so just three source files). And bring that in...