Daniel Arndt

Results 829 comments of Daniel Arndt

Note that `m_tile` is public as of now (which we might change).

I think we mostly cared about the optimization in case the view count was 1 and restricting it to that case sounds sensible to me.

For ```C++ #include constexpr int step_width = 16; template value_type team_reduce_v0(sycl::nd_item item, value_type value) { auto sg = item.get_sub_group(); const auto sub_group_range = sg.get_local_range()[0]; const auto vector_range = 1; const...

The latest results for ```C++ #include constexpr int step_width = 16; #define DO_SHIFT_01(SHIFT) \ if (vector_range * SHIFT < sub_group_range)\ {\ const value_type tmp = sg.shuffle_down(value, vector_range * SHIFT); \...

> @masterleinad can you add a changelog entry for this update to the 4.4 tracker #6914 ? done

> So, when I specify both Kokkos_ENABLE_HIP and Kokkos_ENABLE_OPENMP, the openmp flags are added after the HIP flags. Where is that order specified and is there a way to switch...

> Ok, but I could not find the place where -fopenmp is added to the kokkoscore target COMPILE_OPTIONS. `OpenMP` flags are not added to `COMPILE_OPTIONS` since we use `target_link_libraries` for...

https://github.com/kokkos/kokkos/pull/5473#issuecomment-1299102792 is related.