server icon indicating copy to clipboard operation
server copied to clipboard

triton gpu deploy suddenly become very slow from 0.03s to 12s, how to solve it ?

Open yiluzhuimeng opened this issue 5 months ago • 1 comments

Description A clear and concise description of what the bug is.

Triton Information What version of Triton are you using?

Are you using the Triton container or did you build it yourself?

To Reproduce Steps to reproduce the behavior.

Describe the models (framework, inputs, outputs), ideally include the model configuration file (if using an ensemble include the model configuration file for that as well).

Expected behavior A clear and concise description of what you expected to happen.

yiluzhuimeng avatar Sep 20 '24 12:09 yiluzhuimeng