Gaetan RACIC comments

Results 8 comments of


                                            Gaetan RACIC

Update to enable max_length parameter for AutoModels/M2M-100

Hello there! Any news on this PR ? I'm also facing OOM because the inputs are too long. I would also like to be able to set the maximum length...

Build fails to copy all required linux shared objects for TensorFlow version 1.15.

Hi @craigwalton, I'm facing the same issue as you. In the meantime, I've created my own NuGet which is working. But it is not productized. It seems this project is...

Build fails to copy all required linux shared objects for TensorFlow version 1.15.

Thanks for your answer 👍

Introduce queue timeout for prediction requests

Hello @msaroufim ! Thanks for the new CI. I opened a PR few months ago for the same feature request but did not manage to pass the CI. We are...

Introduce queue timeout for prediction requests

It is not clear to me why testModelWithCustomPythonDependency fails

Introduce queue timeout for prediction requests

> @nateagr Thank you for the PR. From customer perspective, users only care about the overall inference latency. Internally TS can have multiple phases/queue (eg. in workflow). I think your...

Introduce queue timeout for prediction requests

> @nateagr I suggest using "inferenceTimeout" to measure the overall end-to-end timeout. For example, user specifies "inferenceTimeout= 30sec", an inference request will be terminated as long as this request exceeds...

Introduce queue timeout for prediction requests

Hi @lxning, I'm going to rephrase what you are suggesting just to make sure we are in phase: we introduce a new parameter "inferenceTimeout" that we use to terminate/abort prediction...