cuda-api-wrappers issues

Let the user supply a string buffer for the compilation log

Let's not force making an extra copy of the log, allowing the user to pass in a buffer reference somehow.

eyalroz

enhancement

resolved-on-development

nvrtc

Support memory pools (introduced in CUDA 11.2)

1

CUDA 11.2 introduced a "memory pool" mechanism; we should support it: Full API documentation [here](https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__MEMORY__POOLS.html). ```lang-cpp cudaError_t cudaMallocFromPoolAsync ( void** ptr, size_t size, cudaMemPool_t memPool, cudaStream_t stream ) cudaError_t cudaMemPoolCreate...

eyalroz

task

Support asynchronous memory allocation

2

CUDA 11.2 added asynchronous memory allocation and de-allocation. Let's support that. API description: [here](https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__MEMORY__POOLS.html). ``` cudaError_t cudaFreeAsync ( void* devPtr, cudaStream_t hStream ); cudaError_t cudaMallocAsync ( void** devPtr, size_t size,...

eyalroz

task

resolved-on-development

Integrate or supersede the functionality in nVIDIA's jitify

3

NVIDIA's [jitify](https://github.com/NVIDIA/jitify) library provides a C++'ish interface to (some of? all of?) the real-time compilation / JIT compilation facilities nVIDIA provides. This library should provide this functionality, in particular; and...

eyalroz

enhancement

task

When rendering compilation options to a string, we get an extra space

When we render compilation options into a string, it ends with an extra space. To fix this, we'll probably need to write startout instead of endopt, and have that have...

eyalroz

bug

resolved-on-development

A builder-class for NVRTC programs

It would be useful if one could build NVRTC programs incrementally, adding and setting headers, options, etc. at one's convenience rather than when constructing a `program_t` object.

eyalroz

enhancement

resolved-on-development

Compilation log vector<char> contains trailing '\0'

It seems `nvrtcGetProgramLogSize()` includes 1 for a trailing '\0' character, and so we end up placing it in our return value - which is not a C-style string. Let's not...

eyalroz

bug

resolved-on-development