JackAKirk comments

Results 147 comments of


                                            JackAKirk

[LAPACK][CUSOLVER] Add potrf and getrs batch functions to cuSolver

> @AidanBeltonS My logs were from a build of llvm from 3/19, and it looks like the multistream patch was committed on 5/17, but for sanity I'll try building from...

add queue.wait() to sync interop stream

> Would this achieve asynchronous submissions? > Actually I think just from the observation that it fixes the test failures it must be effectively blocking future submissions (at least those...

add queue.wait() to sync interop stream

I've now added blocking waits (using `queue.wait()`) to all places where they were missing. I think that this is required for generally correct behaviour: we should block all streams in...

add queue.wait() to sync interop stream

I've found out that these cusolver functions are apparently asynchronous, even though the Nvidia documentations implies that they are synchronous: therefore I think that `depends_on` is behaving correctly. Also I've...

add queue.wait() to sync interop stream

> I've found out that these cusolver functions are apparently asynchronous, even though the Nvidia documentations implies that they are synchronous: therefore I think that `depends_on` is behaving correctly. Also...

add queue.wait() to sync interop stream

@AidanBeltonS could you check this is all OK? Thanks

update the matrix spec based on new use argument

By the way, I corrected a lot of typos that are still relevant for the new document here: https://github.com/intel/llvm/pull/6525/files Basically if you ignore the new sections I added there (####...

update the matrix spec based on new use argument

Looks pretty good overall. I just made some pretty simple suggestions.

update the matrix spec based on new use argument

> > > You have this inconsistency throughout the spec. In many places, you refer to the three matrices as a, b, and c. However, the use enum refers to...

update the matrix spec based on new use argument

LGTM. I just suggested some small changes.