Indrajit Bhosale comments

Repositories
Issues
Comments

Results 14 comments of


                                            Indrajit Bhosale

TensorRT model low throughput

Hello @rs-ixz , Did the suggested flag resolve your issue? If yes we would like to close the issue Also regarding cpu pinned memory we have not seen any know...

TensorRT model low throughput

Hi @rs-ixz , I suspect the latency could be due to max_batch_size mismatch in models. Can you confirm both models have the same batch_size? You can check using curl localhost:8000/v2/models/"model...

Docs: Move Multi-Node documentation to Archives in favor of EKS-Mulit…

> Couldn't this move been seen as NVIDIA implicitly preferring EKS to other Kubernetes solutions? We have a GCP version in the works by with some google engineers

Failed to allocated memory for requested buffer of size X

Hello @aaditya-srivathsan, thanks for reaching out can you provide some more information? Can you try unloading some other models and then loading this model? It could very well be that...