Paul Richmond comments

Results 13 comments of


                                            Paul Richmond

Question: Are cuModules shared between kernels from same program

Thanks for the reply @benbarsdell. This is certainly an issue for us, particularly when it comes to constant memory. We have a number of large constant and statically sized device...

Question: Are cuModules shared between kernels from same program

@maddyscientist Yes this might work so long as you can link multiple kernels against the same module (containing the constant definition). Presumably this is fine as they are in the...

Question: Are cuModules shared between kernels from same program

@benbarsdell Yes I imagine that you are right as after linking there would be multiple modules with duplicate definitions of the constant. To set the constant value would require doing...

Question: Are cuModules shared between kernels from same program

@benbarsdell We have a work around for this for now but it would be a nice feature to enable instantiation of multiple kernels from the same module.

Can't build with NVCC option '--Werror cross-execution-space-call' on Windows

There is also an issue with assignment in a conditional statement. I have issued a PR (#63) for that but am not sure how to fix the issue above.

Unified memory for device oversubscrption

External benchmark use is evident at [QMUL](https://blog.hpc.qmul.ac.uk/benchmarking-grace-hopper-nodes/) and internally on the [RSE Blog](https://rse.shef.ac.uk/blog/2023-08-18-benchmarking-flamegpu2-on-h100-a100-and-v100-gpus/). This issue will be shared to capture any feedback on further benchmark ideas that could measure host...

Paul Richmond

Question: Are cuModules shared between kernels from same program

Question: Are cuModules shared between kernels from same program

Question: Are cuModules shared between kernels from same program

Question: Are cuModules shared between kernels from same program

Can't build with NVCC option '--Werror cross-execution-space-call' on Windows

Unified memory for device oversubscrption

Python wheel `libnvrtc-builtins.so`

Python Binary Distribution with multiple configurations

Self hosted runners on ITS VMs

Self hosted runners on ITS VMs