onnxruntime_backend Memory Leak When Using ONNXRuntime With OpenVino EP

Memory Leak When Using ONNXRuntime With OpenVino EP

Open narolski opened this issue 2 years ago • 6 comments

Description Using the same model as in #102, the Triton Inference Server has a memory leak, as observed by docker stats, after adding:

  execution_accelerators {
    cpu_execution_accelerator : [ {
      name : "openvino"
    } ]
  }

to model config.

Without the openvino EP usage, there is no memory leak

Triton Information What version of Triton are you using? openvino==2022.1.0 with triton-onnxbackend==22.06 and onnxruntime==1.11.1.

Are you using the Triton container or did you build it yourself?

Custom container build.

To Reproduce

See #102 for model.

Expected behavior A clear and concise description of what you expected to happen.

Provision of model configuration flags (like in #102) that will customize the memory handling of OpenVino EP.

Jul 26 '22 08:07 narolski

After further investigation of this issue I've determined that there was a memory reusage solution implemented for OpenVino EP: https://github.com/openvinotoolkit/openvino/pull/11667.

I will try to build the OpenVino master branch with the changes from the above PR to see if it resolves this issue.

Update:

Building the OpenVino with changes from https://github.com/openvinotoolkit/openvino/pull/11667 did not solve the issue for my model.
I've also reported the bug to the OpenVino team https://github.com/openvinotoolkit/openvino/issues/12307

Jul 26 '22 09:07 narolski

There is another PR to solve rnn cache increasing issue, could help to try it? https://github.com/openvinotoolkit/openvino/pull/12053

Jul 27 '22 02:07 riverlijunjie

I have a same problem on CRAFT model, even I converted CRAFT to openvino IR format, does this to be fix?

Nov 15 '22 23:11 luvwinnie

You can try the openvino 2022.2 or latest master branch.

Nov 16 '22 01:11 riverlijunjie

@narolski Has this problem been solved?

Dec 27 '23 06:12 zhuangxiaopi

onnxruntime_backend onnxruntime_backend copied to clipboard

Memory Leak When Using ONNXRuntime With OpenVino EP

onnxruntime_backend
onnxruntime_backend copied to clipboard