ikarlin
ikarlin
@koparasy do you always see the same cell as the problem or does that change from time to time. Some of the CUDA versions have an unidentified race condition. I...
@koparasy thanks. I have confirmed with Nvidia this is the known race condition. We are discussing the best way to get the fix into the code. Do you have a...
There probably is a bug in their code for testing processors with core counts of 8 or higher looking at the results. I'm not sure how the {NUM_CPU_PHYSICAL_CORES_CUBE and other...
@nmhamster are you using master? If so I'm not sure how this output makes sense since the overall number and the elapsed time are the same value being printed out...
@nmhamster these are fixed in master. Do you need a tagged release? If so I think the easiest solution would be for me to tag a new release since there...
@nmhamster after some thought I think a new tagged release is overdue. I need to do a bit of performance testing to confirm no significant regression, but otherwise the code...
Yes the CUDA version is not maintained. It was a Nvidia port. and Nvidia has not been updating it. The mainline code is maintained.
@ItayGarin this looks fine, but I have a question out to a local OpenMP expert about this before I approve. Mostly its a rant about how OpenMP does things that...