Paul Richmond

Results 13 comments of Paul Richmond

Thanks for the reply @benbarsdell. This is certainly an issue for us, particularly when it comes to constant memory. We have a number of large constant and statically sized device...

@maddyscientist Yes this might work so long as you can link multiple kernels against the same module (containing the constant definition). Presumably this is fine as they are in the...

@benbarsdell Yes I imagine that you are right as after linking there would be multiple modules with duplicate definitions of the constant. To set the constant value would require doing...

@benbarsdell We have a work around for this for now but it would be a nice feature to enable instantiation of multiple kernels from the same module.

There is also an issue with assignment in a conditional statement. I have issued a PR (#63) for that but am not sure how to fix the issue above.

External benchmark use is evident at [QMUL](https://blog.hpc.qmul.ac.uk/benchmarking-grace-hopper-nodes/) and internally on the [RSE Blog](https://rse.shef.ac.uk/blog/2023-08-18-benchmarking-flamegpu2-on-h100-a100-and-v100-gpus/). This issue will be shared to capture any feedback on further benchmark ideas that could measure host...

Suggestion. Try a build of a simple example on Sheffield HPC system (Cent 7) to see if we can replicate this.

Colab has now moved to 11.2 ``` nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2021 NVIDIA Corporation Built on Sun_Feb_14_21:12:58_PST_2021 Cuda compilation tools, release 11.2, V11.2.152 Build cuda_11.2.r11.2/compiler.29618528_0 ```

An alternative to this is dynamic AWS instances. There is an action for this: https://github.com/machulav/ec2-github-runner Rather than run this from some private account the preferred method would be to run...

GitHub now has large runners with GPU backends for a (hefty) price. See: https://docs.github.com/en/actions/using-github-hosted-runners/about-larger-runners/managing-larger-runners