Konstantinos Parasyris
Konstantinos Parasyris
Hello, I am compiling gpusph with "make" then I execute " make ProblemExample". And then I execute ./GPUSPH and I get the following error: `Device 0 thread 140735808663952 iteration 0...
I am running Lulesh on a single node with 160 cpus and 4 (Tesla V100-SXM2) gpus. I am using openmpi-3.0.0 with cuda cuda 9.1. I execute the following command: mpirun...
Pre-allocates all data on the device and uses respective pointers when calling the kernels. Currently does not de-allocate at the end of the execution.
Add a simple patch to the `cmake` file of torch. Torch when using `USE_SYSTEM_FP16` variable assumes the FP16 headers to exist under a hard code path (`/usr/include/fp16.h`). We patch the...
Check for the existence of `CUDNN_FRONTEND_PATH` before looking in default directories. If it exists, skip additional checks. Further, add include and library paths based on `CUDNN_PATH` variable.
Defines the location of CUDNN to the makefile. This is the second part of the original #628 as requested by @rosslwheeler
As discussed in #1341 CIRGen does not match OG CG. This PR mirrors the skeleton of OG.
Hi, do you plan to enable hip support, similar to cuda?