cuda-api-wrappers icon indicating copy to clipboard operation
cuda-api-wrappers copied to clipboard

Thin, unified, C++-flavored wrappers for the CUDA APIs

Results 151 cuda-api-wrappers issues
Sort by recently updated
recently updated
newest added

There are more APIs for working with textures, CUDA arrays and surfaces, than we support. While our array copying support has improved in recent years, our basic representation capability is...

missing-cuda-feature

Hi! Thank you for your great work. Do you have a call method with this function? It seems to be used in a different way in the warehouse. That's all...

Let's add comparison operators for classes which are missing them, like contexts - and quite a few others, it seems.

task

Let's add the matrixMulCUBLAS example program.

task
resolved-on-development

With CUDA 12.4, NVIDIA introduced [a new library for writing fatbin files](https://docs.nvidia.com/cuda/nvfatbin/index.html). We need to figure out how to support this (functions on other classes, separate-mostly library etc.) and then...

task
resolved-on-development

`cuda::poor_mans_option::value_or` should be a `const` method, and it isn't.

task
resolved-on-development

The different vectorAdd examples have different random value generators; and some of them generate `double`s... let's unify all that (and use the standard library, even though it makes the programs...

task
resolved-on-development

Our launch config methods for setting block size or grid size in blocks, which take a single value - take the "correct" type: `grid::dimension_t` or `grid::block_dimension_t`. That's a problem, since...

task
resolved-on-development

CUDA NVRTC programs have the API function: ``` nvrtcResult nvrtcGetLoweredName( nvrtcProgram prog, const char *const name_expression, const char** lowered_name); ``` which mangle a function's name (or its signature). Unfortunately: *...

task

unique_span deletion - destruction and de-allocation - is more complex than one might think. With trivially-constructible types, we can supposed just use `new T[size]` to start off and then `delete[]`...

task
resolved-on-development