onnxruntime_backend issues

Version dependency still present

1

**Description** I would like to be able te replace the `libonnxruntime.so` binary (as well as associated ones) without rebuilding the entire backend, for easier experimentation / testing / debugging. There...

romain-keramitas-prl

bug

Regressions for ORT backend

1

**Description** We have seen regressions in terms of performance for an ONNX model: - using the ORT backend - with `Loop` and `Memcpy` nodes (the latter is probably the most...

romain-keramitas-prl

bug

Global GPU Memory Limit

Is there a way to control (limit) the global GPU memory usage of the onnxruntime backend in triton? The tensorflow backend has the following CLI: ``` --backend-config tensorflow,gpu-memory-fraction=X ``` I...

FabianSchuetze

Inference error when yolov7 model cant detect anything

**Description** I exported yolov7 detection model to onnx using this code https://github.com/WongKinYiu/yolov7/blob/main/export.py and deployed it to triton. It worked really well in normal case, but when model cant detect anything...

LeDuySon

GPT2 performance degradation with higher sequence length on ONNX Runtime

**Description** I've been trying various huggingface models on Triton using the ONNX Runtime backend. The models are first converted from huggingface to onnx using one of onnxruntime converters and then...

rgallardone

GPT2 performance degradation with higher sequence length

**Description** I've been trying various huggingface models on Triton using the ONNX Runtime backend. The models are first converted from huggingface to onnx using one of onnxruntime converters and then...

rgallardone

How to vary onnxruntime version in backend

Our onnx models need onnxruntime version 1.10.0. I'm using triton server version 22.08 which has onnxruntime version 1.11.1. How can i use the required version?

vaishnavi-kotturu

ONNX Backend Fails to Initialize with String Input

**Description** When attempting to launch a model converted to ONNX with `convert_sklearn`, the model fails to load with this error: ``` UNAVAILABLE: Internal: onnx runtime error 6: Exception during initialization:...

dherms

Change default flag value for OpenVINO

mc-nv

Is onnxruntime-genai supported?

1

Hey all, I have a quick question, is onnxruntime-genai ([https://onnxruntime.ai/docs/genai/api/python.html](https://onnxruntime.ai/docs/genai/api/python.html)) supported in Triton Inference Server's ONNX runtime backend? I couldn't find relevant sources in the documentation. Thanks in advance!

jackylu0124

onnxruntime_backend
onnxruntime_backend copied to clipboard

Metadata

Version dependency still present

Regressions for ORT backend

Global GPU Memory Limit

Inference error when yolov7 model cant detect anything

GPT2 performance degradation with higher sequence length on ONNX Runtime

GPT2 performance degradation with higher sequence length

How to vary onnxruntime version in backend

ONNX Backend Fails to Initialize with String Input

Change default flag value for OpenVINO

Is onnxruntime-genai supported?

← Metadata

Owner

Metadata

onnxruntime_backend onnxruntime_backend copied to clipboard

Metadata

← Metadata

Owner

Metadata

onnxruntime_backend
onnxruntime_backend copied to clipboard