Kris Hung comments

Results 101 comments of


                                            Kris Hung

Failed when building for Windows 10

Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this.

run testing triton fail

Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this.

Dynamically loaded models don't work with ensemble

Hi @fran6co, could you also provide the full command you are using to run tritonserver? @GuanLuo Do you see anything which could help here?

Build latest triton custom image for Ubuntu 18.04

Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this issue.

Support for vLLM and TRT-LLM running in OpenAI compatible mode

Thanks for submitting feature request! CC @nnshah1 on the request for starting server with the OpenAI compatible API. For the client side, we have introduced the [generate endpoint](https://github.com/triton-inference-server/server/blob/main/docs/protocol/extension_generate.md). which is...

Tritonserver Physical RAM Grow Overtime

Hi @apokerce, thanks for the repro steps, we will be looking into the memory issue. Meanwhile, could you also provide the full output from Valgrind? In our CI testing we...

Tritonserver Physical RAM Grow Overtime

@apokerce Thanks for providing the file. Can you also let me know what kind of hardware like GPU/device/framework that you are using? I wasn't able to repro the OOM issue...

Tritonserver Physical RAM Grow Overtime

Hi @apokerce, I ran the reproducer on A40 but still couldn't observe any memory growth. I used the following script to run perf_analyzer for several iterations and obtained memory usage...

Tritonserver Physical RAM Grow Overtime

Hi @apokerce, thank you for providing the files. I reran the test, and the results are basically the same. The memory usage for GRPC remains unchanged just like the above...

Tritonserver Physical RAM Grow Overtime

Thanks for the info, @apokerce ! I'm running the experiment with the command you provided to see if I could see the same. Bug fixes are included in newer version...