Ryan McCormick
Ryan McCormick
Hi @Kellel, If you're just looking for the version of Triton that is currently running, is the server metadata endpoint sufficient? ```bash $ curl -s localhost:8000/v2 | jq { "name":...
Hi @yupbank, Thanks for the contribution! Have you filled out the CLA: https://github.com/triton-inference-server/server/blob/main/CONTRIBUTING.md#contributor-license-agreement-cla?
Hi @KhaledButainy, thanks for raising this issue. @matthewkotila could you comment on the valid combinations of parameters for PA here?
Hi @okyspace, thanks for filing this. Just to clarify, this is a request for a recommended RBAC configuration when using Triton in Kubernetes to use as an example/reference to go...
Hi @billcai, thanks for raising this request! CC @nskool
@GuanLuo @tanmayv25 @Tabrizian I believe there are some relaxed checks on the hot path for performance purposes. Do you have any comments on problem areas or risks?
Hi @HennerM, thanks for raising this. Do you mind adding the following: 1. Sharing a quick pytorch script to generate the `model.pt` for your example 2. Updating your example client...
Hi @ganeshmojow, Thanks for filing an issue. @nv-kmcgill53 could you help take a look here?
Hi @aviv12825, I see the errors returned involve "connection refused". Have you confirmed from the pod logs that the server itself started up successfully to expose these endpoints?
Hi @LLautenbacher, thanks for raising this issue with such detail. @Tabrizian @krishung5 may be able to chime in here. Is is possible this commented line is causing an extra copy?...