Georg Hager
Georg Hager
Update: This seems to occur when the index increment is exactly 1, which happens when the compiler does neither vectorize nor otherwise unroll the loop: ``` vmovss %xmm0, (%rsi,%rdx,4) incq...
Still does not work reliably in 0.6.0: ``` Executing (compile): clang-5.0 -Ofast -mavx -D_POSIX_C_SOURCE=200112L -std=c99 himeno.c_compilable.c -S -I/home/gh/programming/python/lib/python3.6/site-packages/kerncraft/headers/ IACA analysis failed: pointer_increment could not be detected automatically. Use --pointer-increment to...
Here we are: [himeno.c_compilable.txt](https://github.com/RRZE-HPC/kerncraft/files/1547379/himeno.c_compilable.txt) Had to rename it - github does not allow .s files as attchments :-/ This was generated with clang 4.0 and the -O3 -mavx options. The...