Katherine Yang

Results 100 comments of Katherine Yang

Hello @rs-ixz can you share the exact model you're using for TRT and Tensorflow? This is so there's no confusion in reproducing your results. If `CUDA_VISIBLE_DEVICES=8`, that should mean only...

> @jbkyang-nvi the models are available in the archive I attached few responses ago (TRT_Slowness.zip) Thanks. Sorry I missed the zip file. How are you converting from tensorflow savedmodel to...

Thanks for your quick response! While I'm working on a reproducer, can you try creating the model with ``` –optShapes flags to control the range of input shapes including batch...

@rs-ixz can you also list the GPUs you are using for measuring perf?

Hello, what does your model configuration file `config.pbtxt` look like? Also Triton's up to 24.03 right now. Is there a reason why you are not using the latest version?

Hello, your image did not upload. Can you specify your model? What version of Triton you are using etc? Aka the original bug report template here: **Description** A clear and...

> Thanks for looking at this. > > Here is a full example that you can run with `pytest`. For demonstration purpose I use the identity function as model :`torch.nn.Identity()`...

Hello while we try to reproduce your issue, can you update your client + server to Triton 24.03? 23.04 is 1 year old and we don't really maintain containers that...

for `cuda-memory-pool-byte-size` is per GPU. As per the tritonserver cli: > The total byte size that can be allocated as CUDA memory for the GPU " "device. If GPU support...

CLA is approved. @jasoncwik can you rebase as well?