tensorrtllm_backend
tensorrtllm_backend copied to clipboard

Published 20 hours ago •

triton-inference-server

Reame
Issues

What is the purpose of shm-region-prefix-name and what is the prefix0_ files used for?

Open sugam-nexusflow opened this issue 9 months ago • 0 comments

Also, does the docker shared memory size impact the inference speed?

Jan 28 '25 21:01 sugam-nexusflow