ipex-llm
ipex-llm copied to clipboard
B60 Cannot Use FP16 Precision with GLM4-32B-0414
Describe the bug B60 Cannot Use FP16 Precision with GLM4-32B-0414 How to reproduce Dtype float32, lowbit fp16, the issue occurs.
Screenshots
Currently, neither XPU nor CUDA support using float32 dtype when running weight as fp16. .