Ryan McCormick comments

Results 163 comments of


                                            Ryan McCormick

Add --metrics-address to specify a listening address specifically for the metrics server

Hi @OvervCW, for simplicity since we didn't receive a CLA and recently had some major changes around the structure of `main.cc`, I've filed a similar PR here to add this...

Add --metrics-address to specify a listening address specifically for the metrics server

My apologies @OvervCW, we do appreciate the contribution nonetheless! For future PRs please do let us know that a CLA was already submitted, as it helps the team verify more...

Rearrange metric enablement, so that model metric reporter can procee…

Hi @ClifHouck, thanks for this contribution! While you have figured out a way to have the existing logic propagate the GPU labels to the generic per-model inference metrics - I...

Rearrange metric enablement, so that model metric reporter can procee…

> (1) Clearly MetricModelReporter expected metrics to be decisively enabled or disabled by the time that InferenceServer::Init is called. I think that's a reasonable thing to expect. `lserver->Init()` initializes most...

Fix double slot release after request cancellation

Thanks for the submission @HennerM ! We're looking into this PR and the underlying root causes or edge cases throughout the Sequence Batch Scheduler.

Authorisation for endpoints

CC @GuanLuo @whoisj

Authorisation for endpoints

Hi @okyspace, we have the restricted endpoint feature for both HTTP and GRPC endpoints: https://github.com/triton-inference-server/server/blob/main/docs/customization_guide/inference_protocols.md#limit-endpoint-access-beta. You should be able to setup key/value pairs to authorize specific routes/features. Does this satisfy...

Ryan McCormick

Add --metrics-address to specify a listening address specifically for the metrics server

Add --metrics-address to specify a listening address specifically for the metrics server

Rearrange metric enablement, so that model metric reporter can procee…

Rearrange metric enablement, so that model metric reporter can procee…

Fix double slot release after request cancellation

Authorisation for endpoints

Authorisation for endpoints

Authorisation for endpoints

add the -L parameter to fix #5934

add the -L parameter to fix #5934