Katherine Yang comments

Results 100 comments of


                                            Katherine Yang

Add support for loading onnx files with the tensorRT backend

Closing this issue due to lack of activity. If this issue needs follow-up, please let us know and we can reopen it for you.

Build latest triton custom image for Ubuntu 18.04

> > > Cant I use an existing docker file with dependencies listed for triton? > > > > > > The problem is that the Dockerfile for the `min`...

Build latest triton custom image for Ubuntu 18.04

Also, there's an official support path for Nvidia products that could potentially help you with this issue. https://www.nvidia.com/en-us/data-center/products/ai-enterprise/#benefits

One click deployment to GKE no longer works as Istio deprecated

Hi, we're still working on this issue with GKE about this.

Large model output is copied when working from a BLS

Closing this issue due to lack of activity. If this issue needs follow-up, please let us know and we can reopen it for you.

Large model output is copied when working from a BLS

@zhaozhiming37 you can read up about how to use CUDA shared memory here: https://github.com/triton-inference-server/client#cuda-shared-memory and https://github.com/triton-inference-server/client#download-docker-image-from-ngc You can find examples here: https://github.com/triton-inference-server/client/blob/main/src/python/examples/simple_http_shm_client.py When you are sending new requests you set...

Katherine Yang

Add support for loading onnx files with the tensorRT backend

Build latest triton custom image for Ubuntu 18.04

Build latest triton custom image for Ubuntu 18.04

One click deployment to GKE no longer works as Istio deprecated

Large model output is copied when working from a BLS

Large model output is copied when working from a BLS

Relative path for s3 storage

Only store channel if use_cached_channel is true

Only store channel if use_cached_channel is true

Fix if statement in setup-pre-container.sh