Kris Hung
Kris Hung
Closing due to lack of activity. Please re-open the issue if you would like to follow up with this issue.
Looks duplicated to this issue in TRT-LLM repo: https://github.com/NVIDIA/TensorRT-LLM/issues/2443/
I think the `TRITON_MODEL_DIRECTORY` is only supported in Triton Python backend at the moment. It is up to the backend to be able to dynamically set the path when using...
Closing due to lack of activity. Please re-open the issue if you would like to follow up with this issue.
Hi @Joenhle, thanks for fling the issue. I was wondering which version of Triton do you use? Can you please provide a minimum reproducer for us to investigate? The order...
Closing due to lack of activity. Please re-open the issue if you would like to follow up with this issue.
Hi @JamesBowerXanda, Triton doesn't officially support Mac, but I assume it would work if you are only running CPU-only model. I couldn't reproduce the hang using a linux machine. Since...
Closing due to lack of activity. Please re-open the issue if you would like to follow up with this issue.
Hi @yaysummeriscoming, the TRT-LLM container has different dependency stack. Some packages like pytorch are required during runtime so the image size is larger than the other Triton images. We are...