Luc Berger

Results 228 comments of Luc Berger

I am looking into it, unfortunately I found something unrelated at the moment so I'll fix that first. Hopefully I can reproduce your issue soon!

FYI, I am having a hard time finding an installation of cuda 11.8, but I found a machine with 11.7 so I'll give that a try. Might not have too...

Still on it but it is taking longer than I hoped.

Sure that sounds reasonable. I would like to point out that some of the features listed above are implemented (partially and/or fully) here: https://github.com/kokkos/kokkos-tools/blob/ecp-kpp3/profiling/kpp3-verifier/kp_ecp_kpp3.cpp see this function for example: extract_gpuinfo()

Maybe we should lift some of the code in `kokkos/core/src/impl/Kokkos_Command_Line_Parsing.cpp`? : )

Updating the list a bit based on things we already have implemented: ``` POTRF POTRS PTEQR GEQRF --> MueLu has also wanted this for a while GETRF GETRS --> I...

@eeprude maybe start by looking at the dependency graph of these functions and see if we have all the necessary BLAS calls required to implement them. If not we should...

@seanofthemillers I'm at SC22 this week, will look at this when I get back

@seanofthemillers @brian-kelley Is it fair to say that this PR has been superseded by recent work that was merged in Kokkos Kernels on integrating TPLs and redesigning our interfaces for...

@ndellingwood so with cuda 12.4 we have the `batched_dla_cuda` and `batched_gemm_cuda` working correctly? Anything else failing on that platform?