Kshitij Srivastava
Kshitij Srivastava
You are running your model and the input tensor on a cpu. it should be on a gpu
TRT engine is a black box. Netron wont work with a trt engine
You are explicitly copying tensors from gpu to cpu, hence , the cost of transfer
I mean if you really want to copy tensor from gpu to cpu, then you will have to incur some transfer time
Tried this example with PR #691 and there is no error.