kokkos
kokkos copied to clipboard
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
This PR advances the integration of the OpenACC backend into kokkos. It introduces `parallel_for` and `parallel_reduce` for `RangePolicy` and bumps the incremental tests to level 5. Pedro did the implementation...
# Contains team-level implementations for: ## NonModifying sequence algorithms - `count_if` ## Modifying sequence algorithms (complete) - `fill`, `fill_n`, `replace`, `replace_if`, `replace_copy`, `replace_copy_if`, `copy`, `copy_n`, `copy_if`, `copy_backward`, `transform`, `generate`, `generate_n`,...
This PR adds `KOKKOS_ENABLE_IMPL_MDSPAN` and `KOKKOS_ENABLE_EXTERNAL_MDSPAN` flags. Additionally, mdspan sources are bundled as a tpl. This is ongoing work for #4421 Note: the license on the mdspan sources caused a...
Greetings, I was trying to compile Trilinos with CUDA and OpenMPI using the following: export OMPI_CXX=${trilinosdir}/packages/kokkos/bin/nvcc_wrapper cmake -DCMAKE_BUILD_TYPE=RELEASE -DTrilinos_MUST_FIND_ALL_TPL_LIBS=TRUE -DTPL_ENABLE_MPI=ON -DCMAKE_CXX_STANDARD:STRING=17 -DTrilinos_ENABLE_Kokkos=ON -DTPL_ENABLE_CUDA=ON -DKokkos_ENABLE_CUDA_UVM=OFF -DTrilinos_ENABLE_OpenMP=OFF -DTrilinos_ENABLE_Amesos2=ON -DTrilinos_ENABLE_Belos=ON -DTrilinos_ENABLE_MueLu=ON -DTrilinos_ENABLE_ROL=ON -DTrilinos_ENABLE_Ifpack2=ON...
This occurs with both version 3.6.01 and well as the master branch: ``` cmake .. -DCMAKE_CXX_COMPILER=clang++ -DCMAKE_INSTALL_PREFIX=`pwd`/../kokkos_install -DKokkos_ENABLE_OPENMP=On -DKokkos_ARCH_HSW=On -DKokkos_ENABLE_TESTS=On make -j install OMP_PROC_BIND=TRUE OMP_NUM_THREADS=16 ./core/unit_test/KokkosCore_UnitTest_OpenMP --gtest_color=no --gtest_filter=openmp.task_fib_single ... Note:...
- Keeps `partiton_master` deprecated. - Introduces `Kokkos_ENABLE_DISABLED_CODE_4`.
Trying #2024 on top of #5277. Fixes #1873.
(Addresses #645) Add sort functions that can be called from device, and exploit team and thread level parallelism. The new functions use bitonic sort, which is good for this because...
Tentative fix for https://github.com/kokkos/kokkos/issues/5320#issuecomment-1209623403
This pull request removes support for the dummy `std::integral_constant` `ViewCtor` properties and replaces the mechanism to add properties (that weren't existent already) with a separate function. This avoids writing much...