zhili

Results 5 comments of zhili

@jennloe We are running a 3D transient subsurface flow model. This test case has 40x24x32=30720 grid cells, resulting a 7-diagonal matrix. The tolerance is 1e-8 and it takes (more or...

Hi @jennloe , just wondering if you have figured out something? Do you have any suggestions on what we could try next? Thanks!

@srajama1 @lucbv Thanks for the suggestion! I have tried the simple-kernel-timer but I am not sure how to decode the output file (or perhaps I did something wrong?) For example,...

Ok I found the reader provided. This is useful indeed! Here are the top 5 time-consuming sections: with 1 thread: - SubsurfaceModel::computeFaceConductivity(SubsurfaceState&, DomainSubsurface&)::{lambda(int)#1} (ParFor) 1.248947 30 0.041632 29.454716 23.021248 -...

@srajama1 I am using Mac+gcc. I just use the default settings when building kokkos kernels. Do you mean I should add -DKokkosKernels_ENABLE_TPL_BLAS=ON when building?