torchchat icon indicating copy to clipboard operation
torchchat copied to clipboard

[FEATURE REQUEST] Clang vectoriation on ARM: `warning: loop not vectorized`

Open mikekgfb opened this issue 10 months ago • 4 comments

(py311) mikekg@mikekg-mbp torchchat % python torchchat.py export --output-dso s.so  --quant '{"embedding": {"bitwidth":8, "groupsize": 32}}' --checkpoint-path ${MODEL_PATH} --temperature 0
Using device=cpu
Loading model...
Time to load model: 0.04 seconds
Quantizing the model with: {'embedding': {'bitwidth': 8, 'groupsize': 32}}
Time to quantize model: 0.05 seconds
Exporting model using AOT Inductor to /Users/mikekg/memory/x/z/a/b/torchchat/s.so
/Users/mikekg/memory/x/z/a/b/torchchat/cjks6zm6fxtuhqcxm7zrxesso4ksap62pjzfrfjhak7h5djxutyu.cpp:523:17: warning: loop not vectorized: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering [-Wpass-failed=transform-warning]
extern "C" void cpp_fused_index_put_stack_1(const float* in_ptr0,
                ^
/Users/mikekg/memory/x/z/a/b/torchchat/cjks6zm6fxtuhqcxm7zrxesso4ksap62pjzfrfjhak7h5djxutyu.cpp:1112:17: warning: loop not vectorized: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering [-Wpass-failed=transform-warning]
extern "C" void cpp_fused_index_put_stack_6(const float* in_ptr0,
                ^
/Users/mikekg/memory/x/z/a/b/torchchat/cjks6zm6fxtuhqcxm7zrxesso4ksap62pjzfrfjhak7h5djxutyu.cpp:1645:17: warning: loop not vectorized: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering [-Wpass-failed=transform-warning]
extern "C" void cpp_fused_index_put_stack_11(const float* in_ptr0,
                ^
/Users/mikekg/memory/x/z/a/b/torchchat/cjks6zm6fxtuhqcxm7zrxesso4ksap62pjzfrfjhak7h5djxutyu.cpp:2197:17: warning: loop not vectorized: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering [-Wpass-failed=transform-warning]
extern "C" void cpp_fused_index_put_stack_16(const float* in_ptr0,
                ^
/Users/mikekg/memory/x/z/a/b/torchchat/cjks6zm6fxtuhqcxm7zrxesso4ksap62pjzfrfjhak7h5djxutyu.cpp:2758:17: warning: loop not vectorized: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering [-Wpass-failed=transform-warning]
extern "C" void cpp_fused_index_put_stack_21(const float* in_ptr0,
                ^
/Users/mikekg/memory/x/z/a/b/torchchat/cjks6zm6fxtuhqcxm7zrxesso4ksap62pjzfrfjhak7h5djxutyu.cpp:3310:17: warning: loop not vectorized: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering [-Wpass-failed=transform-warning]
extern "C" void cpp_fused_index_put_stack_26(const float* in_ptr0,
                ^
6 warnings generated.
The generated DSO model can be found at: /Users/mikekg/memory/x/z/a/b/torchchat/s.so
(py311) mikekg@mikekg-mbp torchchat % 

cc: @manuelcandales @malfet @swolchok

mikekgfb avatar Apr 27 '24 04:04 mikekgfb