cuda-api-wrappers issues

Add support for nvrtcGetSupportedArchs()

With CUDA 11.2, NVRTC added some API functions for determining the target architectures it supports. Let's add support for that.

eyalroz

task

resolved-on-development

Add support for the PTX compilation API

1

Somehow, NVIDIA's separate library for compiling PTX code into SASS escaped me... It's documented at: https://docs.nvidia.com/cuda/ptx-compiler-api/index.html and we should definitely add support for it. There's a "handle" type, similar to...

eyalroz

enhancement

task

Remove more CUDA runtime API uses when using the driver API mostly

1

While the driver wrappers branch has come a long way, it still uses some CUDA runtime API constants, types, and API calls. Some of this might be unavoidable, but many...

eyalroz

task

Support additional arbitrary NVRTC compilation options

There could always be additional NVRTC compilation options which are not explicitly supported. Let's make it possible to add those with no special parsing/combiantion/etc. - to simply be appended to...

eyalroz

task

resolved-on-development

Support for NVRTC diag-suppress/error/warn

With NVRTC, you can choose to either suppress, warn, or emit an error when encountering various issues in the code, using: ``` --diag-error= --diag-suppress= --diag-warn= ``` this is currently not...

eyalroz

task

resolved-on-development

Switch memory functions to only work with regions (and spans)

7

Look at our modified vectorAdd example. It's certainly nicer than the original, but it's just sad that we have to repeat ourselves again and again with respect to lengths and...

eyalroz

question

task

Do something about cudaCpuDeviceId and cudaInvalidDeviceId

1

Some API calls may return `cudaCpuDeviceId` to indicate host memory as a location, or `cudaInvalidDeviceId` to indicate no single location. Right now, we are completely oblivious to these values -...

eyalroz

task

Add NVIDIA's separate-compilation-linking example

Several years ago, the NVIDIA Technical blog / parallel-4-all published this piece: [Separate Compilation and Linking of CUDA C++ Device Code](https://developer.nvidia.com/blog/separate-compilation-linking-cuda-device-code/) and linked to an example repository: [separate-compilation-linking](https://github.com/NVIDIA-developer-blog/code-samples/tree/master/posts/separate-compilation-linking). It would...

eyalroz

task

Add explicit support for pitched linear memory

1

Now that we support CUDA arrays, and do some matter-of-fact dealing with pitched CUDA Runtime API calls, it's probably time we properly expanded that to pitched memory support. Pitched memory...

eyalroz

enhancement

missing-cuda-feature

Support CUDA execution graphs

12

Added with CUDA 10.0, [cuda graphs](https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#cuda-graphs) (though really terribly named as there are 7 "graph" things that cuda provides), is a take on task graphs within CUDA's programming model. What's...

neoblizz

task

cuda-api-wrappers
cuda-api-wrappers copied to clipboard

Metadata

Add support for nvrtcGetSupportedArchs()

Add support for the PTX compilation API

Remove more CUDA runtime API uses when using the driver API mostly

Support additional arbitrary NVRTC compilation options

Support for NVRTC diag-suppress/error/warn

Switch memory functions to only work with regions (and spans)

Do something about cudaCpuDeviceId and cudaInvalidDeviceId

Add NVIDIA's separate-compilation-linking example

Add explicit support for pitched linear memory

Support CUDA execution graphs

← Metadata

Owner

Metadata

cuda-api-wrappers cuda-api-wrappers copied to clipboard

Metadata

← Metadata

Owner

Metadata

cuda-api-wrappers
cuda-api-wrappers copied to clipboard