Nguyen Thanh Minh

Results 5 comments of Nguyen Thanh Minh

Hi @agunapal Can you answer me this question? If my model inferent 100 requests in 1 second with worker = 1, so if worker = 2 I can solve 100...

@agunapal I tried to send 1000 request for worker=1, batchsize = 16 for my_text_classifier and it throw status_cpde 503 it said: "Model \"my_tc\" has no worker to serve inference request....

After changing use_pure = True, timeout is working because [socket.settimeout(None)](https://docs.python.org/3/library/socket.html#socket.socket.settimeout) will not effect to SIGALRM. This is not an issue, this is a notification for anyone who encounters this issue...

Hi @agunapal, thanks for your answer. I meant that I am applying the following limits in Kubernetes: ```yaml resources: limits: cpu: '1' ``` However, my model is running with workers...