QPyTorch float_quantize at multi-gpu works wrong.

float_quantize at multi-gpu works wrong.

Open jinsol-neubla opened this issue 2 years ago • 0 comments

When I run model after applying float_quantize to weight or activation with multi-GPU, (huggingface opt-model with device_map='auto') quantization of layers allocated to second or later gpu works wrong. The output of quantization shows mostly 0-value.

Mar 16 '23 08:03 jinsol-neubla

QPyTorch QPyTorch copied to clipboard

float_quantize at multi-gpu works wrong.

QPyTorch
QPyTorch copied to clipboard