ikarlin

Results 8 comments of ikarlin

@koparasy do you always see the same cell as the problem or does that change from time to time. Some of the CUDA versions have an unidentified race condition. I...

@koparasy thanks. I have confirmed with Nvidia this is the known race condition. We are discussing the best way to get the fix into the code. Do you have a...

There probably is a bug in their code for testing processors with core counts of 8 or higher looking at the results. I'm not sure how the {NUM_CPU_PHYSICAL_CORES_CUBE and other...

@nmhamster are you using master? If so I'm not sure how this output makes sense since the overall number and the elapsed time are the same value being printed out...

@nmhamster these are fixed in master. Do you need a tagged release? If so I think the easiest solution would be for me to tag a new release since there...

@nmhamster after some thought I think a new tagged release is overdue. I need to do a bit of performance testing to confirm no significant regression, but otherwise the code...

Yes the CUDA version is not maintained. It was a Nvidia port. and Nvidia has not been updating it. The mainline code is maintained.

@ItayGarin this looks fine, but I have a question out to a local OpenMP expert about this before I approve. Mostly its a rant about how OpenMP does things that...