cuda-api-wrappers issues

Add the bandwidthtest CUDA sample as an example program

Let's add the bandwidthtest CUDA sample program to our modified CUDA samples.

task

resolved-on-development

Starting CUDA profiling: initialization error

11

Hello, the testcase `vectorAdd_profiled` doesn't work from the box for me: ```sh ./vectorAdd_profiled terminate called after throwing an instance of 'cuda::runtime_error' what(): Starting CUDA profiling: initialization error ``` after adding...

alsam

bug

resolved-on-development

scoped_existence_ensurer_t fails if the driver has not been initialized

Currently, scoped_existence_ensurer_t uses `context::current::detail_::get_handle()`, which assumes the driver has been initialized. To drop this assumption, we need to check the return status of the CUDA driver API call.

eyalroz

bug

resolved-on-development

Use the launch config builder in more examples

The launch config builder (#311 ) is awesome. Let's use it more! There are lots of examples which do this work themselves rather than just availing themselves of the builder.

eyalroz

task

resolved-on-development

config builder is missing a method for setting the device

We can query the device associated with a launch config builder object, but we're missing the method(s) for setting it.

eyalroz

bug

resolved-on-development

Check overall dimensions are not too large to be expressed with our grid and block dimension limits

The launch config builder accepts overall dimensions using `size_t` value. But - those may exceed what CUDA supports. So, we need to check the values are supported, at least in...

eyalroz

task

Support CUlibrary's and CUkernel's - CUDA driver low-level "library management"

Beginning with CUDA 12.0, we now have access to several functions for handling "libraries" of context-less "kernels": https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__LIBRARY.html One can get a context-associated module or kernel by calling `cuKernelGetFunction()` or...

eyalroz

task

resolved-on-development

module proxy class doesn't need to hold its link options

It seems like the link options of a module are not used anywhere after its creation. Well, let's drop them then.

eyalroz

task

resolved-on-development

Move apriori_compiled_kernel_t into the kernel namespace

`apriori_compiled_kernel_t` -> `kernel::apriori_compiled_t` makes more sense... let's move it there. Also, move some functions into a `kernel::apriori_compiled` sub-namespace.

eyalroz

task

resolved-on-development

Structure 'attributes' and 'properties' better, using container-like facades and proxies

Many of our the objects we wrap in the library have all sorts of "attributes" or "properties", with API functions for getting and setting them. At the moment, we reflect...

eyalroz

enhancement

cuda-api-wrappers
cuda-api-wrappers copied to clipboard

Metadata

Add the bandwidthtest CUDA sample as an example program

Starting CUDA profiling: initialization error

scoped_existence_ensurer_t fails if the driver has not been initialized

Use the launch config builder in more examples

config builder is missing a method for setting the device

Check overall dimensions are not too large to be expressed with our grid and block dimension limits

Support CUlibrary's and CUkernel's - CUDA driver low-level "library management"

module proxy class doesn't need to hold its link options

Move apriori_compiled_kernel_t into the kernel namespace

Structure 'attributes' and 'properties' better, using container-like facades and proxies

← Metadata

Owner

Metadata

cuda-api-wrappers cuda-api-wrappers copied to clipboard

Metadata

← Metadata

Owner

Metadata

cuda-api-wrappers
cuda-api-wrappers copied to clipboard