Christian Trott
Christian Trott
I think this will be covered by the mdspan transition soon (hopefully).
So are you saying it works with something older than 2024.1.0 and it works with 2024.2.0?
I did some compile time check and this new thing wins over typeid by a mile: ```c++ #include #include template bool instantiate_one() { #ifdef TYPEID auto s = typeid(Kokkos::View).name(); return...
NVCC 11.7 its 7.7s vs 1m37s
We may wanna organize a meeting for a larger discussion.
Closing: we anyway moved away again from cudaMallocAsync due to too much problems downstream since it doesn't allow IPC. We will revisit memory pools as an explicit feature.
Add the cmake option.
I do not understand how this does what you want with respect to pitched allocations?
I think the accessor needs to store a pitch or some information that every K elements there are 4 extra bytes or whatever.
Also your argument with the alignment doesn't make much sense to me. The object can't say that it is 12byte aligned anyway for pitched allocations, since the whole purpose is...