onnxruntime
onnxruntime copied to clipboard
ppc64le: fix MlasQLinearMulKernel's VSX code to work with inputs of 32 bits
I got a bug in this VSX code when both InputA and InputB are 32 bits. This patch fixes this issue without losing performance.
@yufenglee Requesting review.
/azp run Windows GPU TensorRT CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed
/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline
Azure Pipelines successfully started running 6 pipeline(s).
Azure Pipelines successfully started running 9 pipeline(s).
/azp run Windows CPU CI Pipeline,Windows GPU CI Pipeline
Azure Pipelines successfully started running 2 pipeline(s).