Jean-Luc Fattebert

Results 10 comments of Jean-Luc Fattebert

Sorry, I was using an older commit. Here is the error call stack with the latest master commit: [narsil-gpu2:1602573:0:1602573] Caught signal 11 (Segmentation fault: address not mapped to object at...

This is what cuda-gdb gives me: signal SIGSEGV, Segmentation fault. 0x000015552097d533 in matCopy (n=38315, m=1, Dst=0x1553f7895c80, lddst=38315, Src=0x2a368b70, ldsrc=38315) at /home/q8j/GIT/superlu_dist/SRC/dnrformat_loc3d.c:34 34 Dst[i + lddst * j] = Src[i +...

i=0, j=0, Src[0]=-7.5099551745997222e-13, (cuda-gdb) print Dst[0] Cannot access memory at address 0x1553f7895c80

After further investigations, it looks like the issue is related to the interaction between superlu and Kokkos or Trilinos which my application is using. A stand alone driver runs fine....

@egboman This is what I do. Here are a few details about the code that fails (in call to pdgssvx3d): void SuperLUSolver::solve(const Tpetra::MultiVector& b, Tpetra::MultiVector& x) { // copy b...

@xiaoyeli Printed out values of xv before superlu call look fine (I have not tried to do it inside superlu). I can even copy those values into a simple C++...

I can reproduce the issue on Summit at OLCF. SuperLU build: ---------------------------------------------------------------------------- #!/bin/bash module load gcc/9.3.0 module load parmetis/4.0.3 module load metis module load cuda module load cmake module load...

That should help. Thanks!

The preferred option is to use cmake to build the main code, and autoconf for the base libraries. It may actually be the only one working. So autodonf scripts are...