Alfio Lazzaro comments

Results 115 comments of


                                            Alfio Lazzaro

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

Given the fact that it is suggested to set the GPU even per thread (my understanding was that each thread should keep the same GPU of the rank, it turns...

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

> @alazzaro > Is there any plan of DBCSR to support the Multi-GPU in one node in the neer future? @vitesselin Assuming now other issues && Assuming I can run...

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

@hfp Yes, this is the idea... I must say, we don't need to save per stream, we need to save per rank and apply the setdevice any time we call...

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

Still, each stream within a rank cannot only access to a single device, so all stream will have the same value... Does the ACC implementation allow to have streams with...

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

Concerning the streams, what you did (@oschuett ) is still OK. NIVIDA didn't change that much since then... There are still 2 copy engines/compute with PASCAL, probably you have more...

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

@oschuett I'm far to be an expert, I'm just reporting what it is written here https://devblogs.nvidia.com/cuda-pro-tip-always-set-current-device-avoid-multithreading-bugs/ Quoting the bottom line: `To save yourself from a variety of multithreading bugs, remember:...

Alfio Lazzaro

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

Hit acc_devmem_setzero: failed in dbcsr/acc/dbcsr_acc_devmem.F:445

DBCSR tensor batching

dbcsr_tensor_unittest fails with CCE 10

ACC init/finalize issues