engine
engine copied to clipboard
Introduce an -fpeel-loops flag for GCC build.
This improves performance of scalar-optimized code by 1.5 times as well as fixes #433.
should we probably check whether gcc supports that option ? or is it no harm to enable even if not supported ?