less_slow.cpp icon indicating copy to clipboard operation
less_slow.cpp copied to clipboard

Data Alignment may have error?

Open bfdyanshe opened this issue 1 year ago • 0 comments

The loop in f32_pairwise_accumulation have f32s_in_cache_line_half_k * 2 times, and the other one only have f32s_in_cache_line_half_k times. 图片

bfdyanshe avatar Jul 18 '24 06:07 bfdyanshe