gpu-rodinia icon indicating copy to clipboard operation
gpu-rodinia copied to clipboard

Compliation flags

Open AntonOellerer opened this issue 6 years ago • 1 comments

Hey, is there a certain reason why e.g. the kmeans_openmp benchmark is compiled and linked with -g and -O2, when best performance should be expected without -g and with -O3? Makefile

AntonOellerer avatar Nov 22 '19 17:11 AntonOellerer

Hi, sorry that I never ran these openmp benchmarks by myself, but I'd say that O3 optimization is faster in general. Some algorithms and data structures may fail the higher-degree optimization.

I suggest search for the cases where O3 is slower and check whether the pattern appears in this benchmark.

(Sent from my phone)

On Fri, Nov 22, 2019, 11:25 AM Anton Oellerer [email protected] wrote:

Hey, is there a certain reason why e.g. the kmeans_openmp benchmark is compiled with -g and -O2, when best performance should be expected without -g and with -O3?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/yuhc/gpu-rodinia/issues/4?email_source=notifications&email_token=ABOLNC6BHAJ4CVPAYPKL5PLQVAIZJA5CNFSM4JQTXQ42YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4H3OJA4Q, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABOLNC5KZYMDZDVFHQTFISDQVAIZJANCNFSM4JQTXQ4Q .

yuhc avatar Nov 22 '19 17:11 yuhc