Konstantinos Parasyris issues

Results 9 issues of


                                            Konstantinos Parasyris

illegal memory access was encountered

Hello, I am compiling gpusph with "make" then I execute " make ProblemExample". And then I execute ./GPUSPH and I get the following error: `Device 0 thread 140735808663952 iteration 0...

Volume error When running with cuda and mpi

I am running Lulesh on a single node with 160 cpus and 4 (Tesla V100-SXM2) gpus. I am using openmpi-3.0.0 with cuda cuda 9.1. I execute the following command: mpirun...

[Draft] Removed redundant copies and optimized data transfers

Pre-allocates all data on the device and uses respective pointers when calling the kernels. Currently does not de-allocate at the end of the execution.

Extend the py-torch package to explicitly specify the location of FP16

Add a simple patch to the `cmake` file of torch. Torch when using `USE_SYSTEM_FP16` variable assumes the FP16 headers to exist under a hard code path (`/usr/include/fp16.h`). We patch the...

python

patch

update-package

Check for the existence of CUDNN_FRONTEND_PATH before looking in default directories. If it exists, skip additional checks.

Check for the existence of `CUDNN_FRONTEND_PATH` before looking in default directories. If it exists, skip additional checks. Further, add include and library paths based on `CUDNN_PATH` variable.

Konstantinos Parasyris

illegal memory access was encountered

Volume error When running with cuda and mpi

[Draft] Removed redundant copies and optimized data transfers

Extend the py-torch package to explicitly specify the location of FP16

Check for the existence of CUDNN_FRONTEND_PATH before looking in default directories. If it exists, skip additional checks.

Optionally specify root location of CUDNN

[CIR][HIP|CUDA] Mirrors CUDARuntime skeleton of OG

HIP/AMDGPU support

[CIR][HIP] Lower Device CIR to LLVM IR