Yufeng Li comments

Results 86 comments of


                                            Yufeng Li

java deploy in k8s Failed to load library libonnxruntime_providers_cuda.so with error

Do you have cuda 11 installed? ORT doesn't pack cuda runtime

ppc64le: fix MlasQLinearMulKernel's VSX code to work with inputs of 32 bits

/azp run Windows GPU TensorRT CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

ppc64le: fix MlasQLinearMulKernel's VSX code to work with inputs of 32 bits

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline,...

ppc64le: fix MlasQLinearMulKernel's VSX code to work with inputs of 32 bits

/azp run Windows CPU CI Pipeline,Windows GPU CI Pipeline

DynamicQuantizeLinear opset 20 and float 8

> ### Description > DynamicQuantizeLinear only supports uint 8. This PR adds support for int8 and float 8. > > ### Motivation and Context > The operator is used to...

To make QuantizeLinear and DquantizeLinear function ops

What's the benefit to do this? Q/DQ is very general functionality. Supposedly, each framework that depends on ONNX has an implementation of them.

[python] Include 'per_channel' attribute when calibrating

@Johansmm, we now only support per-channel for weight not activation. The reason is that it is not easy to make underlying kernel faster to support per-channel for both weight and...

[mlas] Speed up tanhf activation function

/azp run ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux GPU CI Pipeline,orttraining-amd-gpu-ci-pipeline,Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

[mlas] Speed up tanhf activation function

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

[mlas] Speed up tanhf activation function

/azp run ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux GPU CI Pipeline,orttraining-amd-gpu-ci-pipeline,Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline