cuda-api-wrappers icon indicating copy to clipboard operation
cuda-api-wrappers copied to clipboard

Thin, unified, C++-flavored wrappers for the CUDA APIs

Results 151 cuda-api-wrappers issues
Sort by recently updated
recently updated
newest added

We include ``, `` and `` too many times - when we have already necessarily included them earlier. Let's reduce these redundancies.

task
resolved-on-development

Currently, the `cuFuncSetAttribute()` can be called only in a member function of `cuda::kernel`. But - there's no good reason it shouldn't be usable outside of that object. Specifially, I'd like...

enhancement
task
resolved-on-development

Our use of `validate_all_dimensions_compatibility()`, realized in the fix for #561, only occurs in debug mode (when NDEBUG is undefined); but I forgot to surround the definition of this function itself...

task
resolved-on-development

It is currently impossible to retrieve the link options from a module object :-(

task
resolved-on-development

Compatibility is broken at least for 9.0 and 9.2, possibly additional CUDA versions.

bug
Regression
resolved-on-development

It seems we define `cuda::module::handle_t` twice; let's drop the second definition.

task
resolved-on-development

It is difficult to remember whether our main target name is runtime-and-driver, or driver-and-runtime. Let's add an alias so that they both mean the same thing.

task
resolved-on-development

In cuda::kernel_t::set_attribute, we have the line: ``` throw(cuda::runtime_error {cuda::status::not_yet_implemented}); ``` which should be in the #else block of the #endif for CUDA_VERSION >= 900 , but - it's in the...

bug
resolved-on-development

Since CUDA 12, the driver finally supports a proper launch configuration object, with a bunch of flags and features: ``` CUresult cuLaunchKernelEx (const CUlaunchConfig* config, CUfunction f, void** kernelParams, void**...

task

Building with CUDA 11.2, and with driver 535.54.03, and on a machine with two Quadro RTX 6000 cards, getting: ``` ----- vectorAddMMAP ----- Vector Addition (using virtual memory mapping) terminate...

bug