tensorrtllm_backend
tensorrtllm_backend copied to clipboard
Support python runtime
Triton inference server supports C++ runtime for Tensorrtllm.
But would be great to support also Python runtime