kernel_tuner icon indicating copy to clipboard operation
kernel_tuner copied to clipboard

Improved OpenACC support

Open isazi opened this issue 10 months ago • 2 comments

Improved support for tuning OpenACC code. Fixes multiple bugs and adds some functionalities, including automatic data transfers to improve performance measurements.

isazi avatar Mar 29 '24 13:03 isazi

@benvanwerkhoven this pull request is ready for merge, would be nice if we could merge before CompSys. Two more features, and OpenMP support, are planned for later during the year, but I don't think that waiting for them before merging this is helpful.

isazi avatar May 13 '24 14:05 isazi

Awesome work! Thanks a lot! I just ran the tests and examples on my laptop and it worked like a charm.

I noticed all the old C and Fortran examples that use either OpenACC or OpenMP are still working, which is good. We should think about whether to keep those or make the new way the recommended way to tune directive-based code and provide examples for just that. If you have an opinion on how to organize the examples, I'm happy to hear it!

I noticed there is basically no mention of any OpenACC support in the documentation. I've added the examples to list of languages that implement the Vector Add example, but we should expand the documentation in the future.

benvanwerkhoven avatar May 23 '24 14:05 benvanwerkhoven