Zero Zeng comments

Results 582 comments of


                                            Zero Zeng

Conv2d stride=2 accuracy mismatch between PyTorch and TensorRT

Can you try exporting the pytorch model to onnx and check the accuracy with polygraphy? reference: https://github.com/NVIDIA/TensorRT/tree/main/tools/Polygraphy https://github.com/NVIDIA/TensorRT/tree/main/tools/Polygraphy/examples/cli/run/01_comparing_frameworks

Does TensoRT support the fusion of leayrelu and conv?

you may refer to https://docs.nvidia.com/deeplearning/tensorrt/developer-guide/index.html#fusion-types

Does TensoRT support the fusion of leayrelu and conv?

or create a case and run it with trtexec --verbose, you are able to see the final engine structure in the log. which will tell if TRT can support your...

EfficientNMS Plugin

![image](https://user-images.githubusercontent.com/38289304/182869691-dce0feb3-51cb-404d-bfc3-f75cf6e9e2e7.png) I think it should be configurable, @rajeevsrao is the author but he is OOTO now :)

Mixed precision(FP16+FP32) engine's memory size

One question here: how do you compute the memory size? In the TRT verbose log there will be memory info about the engine. e.g. ``` 54992 [08/09/2022-22:08:24] [I] Engine built...

Mixed precision(FP16+FP32) engine's memory size

You can see it in the verbose log. try searching "Engine Layer Information"

Mixed precision(FP16+FP32) engine's memory size

> On the hardware, not in the verbose, are’t they the same thing hardware memory usage usually contains other module like CUBLAS and CUDNN. they are not the same thing....

Mixed precision(FP16+FP32) engine's memory size

> Is there any way to get the data type(fp16 or fp32) of layers in mixed engine during inferencing? No, it's only logging in the build phase. > I have...

Mixed precision(FP16+FP32) engine's memory size

https://docs.nvidia.com/deeplearning/tensorrt/api/python_api/infer/Core/BuilderConfig.html#tensorrt.BuilderFlag

Question about trtexec log

@nvpohanh ^ ^