zhili
zhili
@jennloe We are running a 3D transient subsurface flow model. This test case has 40x24x32=30720 grid cells, resulting a 7-diagonal matrix. The tolerance is 1e-8 and it takes (more or...
Hi @jennloe , just wondering if you have figured out something? Do you have any suggestions on what we could try next? Thanks!
@srajama1 @lucbv Thanks for the suggestion! I have tried the simple-kernel-timer but I am not sure how to decode the output file (or perhaps I did something wrong?) For example,...
Ok I found the reader provided. This is useful indeed! Here are the top 5 time-consuming sections: with 1 thread: - SubsurfaceModel::computeFaceConductivity(SubsurfaceState&, DomainSubsurface&)::{lambda(int)#1} (ParFor) 1.248947 30 0.041632 29.454716 23.021248 -...
@srajama1 I am using Mac+gcc. I just use the default settings when building kokkos kernels. Do you mean I should add -DKokkosKernels_ENABLE_TPL_BLAS=ON when building?