modelmesh-runtime-adapter
modelmesh-runtime-adapter copied to clipboard
feat: Add ModelStreamInfer to Triton MethodInfos
Motivation
Fix #80
Modifications
Add ModelStreamInfer to triton MethodInfos
Result
inference.GRPCInferenceService/ModelStreamInfer gRPC requests can be send to triton, which enable the use of triton backends and models which require streaming.
[APPROVALNOTIFIER] This PR is NOT APPROVED
This pull-request has been approved by: Legion2
Once this PR has been reviewed and has the lgtm label, please assign ckadner for approval by writing /assign @ckadner in a comment. For more information see:The Kubernetes Code Review Process.
The full list of commands accepted by this bot can be found here.
Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment