Tobias Ribizel

Results 105 issues of Tobias Ribizel

This adds CUDA support to the compilation. Due to the structure of Ginkgo, this doesn't need to be a runtime dependency, as we can also use the stub library instead....

### Is this a duplicate? - [X] I confirmed there appear to be no [duplicate issues](https://github.com/NVIDIA/cccl/issues) for this bug and that I agree to the [Code of Conduct](CODE_OF_CONDUCT.md) ### Type...

bug

This adds CI jobs on a Windows gitlab-runner with an NVIDIA GPU

reg:ci-cd
1:ST:ready-for-review

To simplify the review of #1516, this does all the necessary changes to automate the majority of the other changes - Add necessary switching headers - Provide device namespace macro...

reg:build
reg:testing
mod:core
mod:cuda
mod:openmp
type:solver
type:preconditioner
type:matrix-format
1:ST:ready-for-review
mod:hip
type:factorization
type:multigrid
mod:dpcpp

This adds a function that prints a textual representation of the device used by an executor, and uses it in tests/benchmarks

reg:testing
reg:benchmarking
type:solver
type:preconditioner
type:matrix-format
1:ST:ready-for-review
type:factorization
type:reordering
type:multigrid
type:stopping-criteria
mod:all

This removes all stride support from #1582, which removes most of the heavy `iterator` functionality added in the other PR. It relies on `zip_iterator` to provide both enumerating `for (auto...

reg:build
reg:testing
mod:core
1:ST:ready-for-review

This requires C++17 support and removes the workarounds we had in place before

reg:build
reg:testing
reg:documentation
mod:core
mod:cuda
reg:example
type:solver
type:preconditioner
1:ST:ready-for-review
mod:hip
type:factorization
type:reordering
mod:dpcpp
1:ST:run-full-test

This replaces `std::tuple` by a custom `device_tuple` implementation to be used on the device. TODO - [ ] Merge #1603

reg:build
reg:testing
mod:core
mod:openmp
mod:reference
type:matrix-format
1:ST:ready-for-review
type:multigrid

This PR adds a bunch of utility abstractions for simplifying common iteration patterns in Ginkgo. 1. `irange`: Similar to the Python `range([start, ] stop[, step])` function, the `irange` provides a...

reg:build
reg:testing
mod:core

Operations like $\alpha A x + 0 y$ may propagate NaNs from y to the output despite the 0 coefficient. This can be avoided by checking the beta scaling factors...

reg:testing
mod:cuda
mod:openmp
mod:reference
type:matrix-format
mod:hip
mod:dpcpp