DeepSpeedExamples
DeepSpeedExamples copied to clipboard
Wrong import in inference quantization example
Hi,
At https://github.com/microsoft/DeepSpeedExamples/blob/master/inference/huggingface/zero_inference/README.md , the referenced import from deepspeed.compression.inference.quantization import _init_group_wise_weight_quantization
is wrong.
The correct one is from deepspeed.inference.quantization import _init_group_wise_weight_quantization
.
Can you please correct it? Best regards, Epliz