Katherine Yang comments

Results 100 comments of


                                            Katherine Yang

Python InferenceServerClient (http) should not call close() from del

@damonmaria why are you manually calling `close()`? Also, just to make sure I understand, are you sharing the `InferenceServerClient` among your threads or do you have one client per thread?

Python InferenceServerClient (http) should not call close() from del

@damonmaria sorry for the delay in response. I was meaning to ask: where are you seeing: > The issue is that this uses gevent which must always be called from...

Python InferenceServerClient (http) should not call close() from del

@ivergara are you also calling `close()`? Can you share your client so we can reproduce the issue?

Python InferenceServerClient (http) should not call close() from del

Also for future reference. As stated [here](https://github.com/triton-inference-server/client/blob/main/src/c%2B%2B/library/http_client.h#L90-L95): > None of the methods of InferenceServerHttpClient are thread safe. The class is intended to be used by a single thread and simultaneously...

Python InferenceServerClient (http) should not call close() from del

@Davidleeeeee > I solved this problem by declaring CLIENT inside the function e.g. > > def predict_batchsize(inputs, model_name='building', batchsize=64, inp_desc=("INPUT__0", "FP32"), otp_desc=("OUTPUT__0", "FP32")): CLIENT = grpc_client.InferenceServerClient(url="192.168.128.29:8001") ... preds = CLIENT.infer(model_name=model_name,...

`tritonclient[grpc]==2.24.0` Produces OOMs When Async gRPC Calls Are Performed

Hi @narolski sorry for the late response. I think it is true Triton client does not deallocate memory while the request is not completed. If that is the problem, we...

[QUESTION] About Concurrent Model Execution Feature

You can also read about this in the [architecture documentation](https://github.com/triton-inference-server/server/blob/main/docs/user_guide/architecture.md#concurrent-model-execution)

too many open files

@wdongdongde can you provide a small reproducible example for the client?

too many open files

Closing ticket because of inactivity. @wdongdongde please reopen with more information if you would like us to look at it

UNAVAILABLE: Internal: Unable to set NUMA memory policy: Operation not permitted

It looks like the name of the model is: `${MODEL_NAME}`. Is that correct?