Adrian Tobiszewski
Adrian Tobiszewski
ONNX backend: https://github.com/triton-inference-server/onnxruntime_backend/pull/234
**Description** Triton is unable to load models with Tensorflow saved model format with OpenVINO backend. **Triton Information** What version of Triton are you using? 23.10,23.11,23.12,24.03,24.04 don't work. Are you using...
### ๐ Summary * Upb patch + compilation fixes on ubi9 * Fix hadolint * Remove dockerfile label version It is not used at all TODO: * CUDA build -...
### ๐ Summary Describe the changes. ID:CVS-141578 ### ๐งช Checklist - [ ] Unit tests added. - [ ] The documentation updated. - [ ] Change follows security best practices....
* Enable usage of OpenCL buffers via OVMS C-API * Enable usage of VA surfaces via OVMS C-API * Enable setting output buffers for C-API inference to avoid copy *...
### ๐ Summary JIRA/Issue if applicable. Describe the changes. ### ๐งช Checklist - [ ] Unit tests added. - [ ] The documentation updated. - [ ] Change follows security...
### ๐ Summary JIRA/Issue if applicable. Describe the changes. ### ๐งช Checklist - [ ] Unit tests added. - [ ] The documentation updated. - [ ] Change follows security...