TensorRT issues

Why can't we set the precision of all layers to fp16 or fp32?

7

## Description Hello, I'm trying to set the precision of specific layers to fp32, but after setting some layers, I don't see any improvement (the final output is still NaN)....

sanbuphy

triaged

[no instance of function template] when installing TensorRT 8.6.1.6

2

## Description I am now installing TensorRT by following the Readme.md instructions. However, when running make, it fails with error below: ``` error: no instance of function template "cuda::std::__4::plus::operator()" matches...

JisuHann

triaged

Using tensorflow quantisation toolkit, how to export to tflite correctly?

1

I am trying to do a QAT training with the simple mobilenetv3-like model. The training goes well, but when I save the model to the Keras and then convert it...

batrlatom

triaged

TensorRT 8.5 deprecated functions

2

I updated my TensorRT version to 8.5 and got warnings about deprecated functions, opened question on NVIDIA developer forums [tensorrt-8-5-depecated-functions](https://forums.developer.nvidia.com/t/tensorrt-8-5-depecated-functions/278907). I used [void nvinfer1::IRuntime::destroy()](https://docs.nvidia.com/deeplearning/tensorrt/archives/tensorrt-801/api/c_api/classnvinfer1_1_1_i_runtime.html#ab07e802f58331e0c7e215e58422bd936) and it `Deprecated interface will be...

lioriz1

triaged

The same engine produces incorrect inference output using C++, but correct results using Python. TensorRT-8.6.1.6+cuda12.1

8

TensorRT-8.6.1.6+cuda12.1 GPU RTX3090 24G

154775258

triaged

Custom Attention implementation not well optimised by TensorRT

12

## Description Hi, I am working on a model that employs a modified attention mechanism, incorporating pooling on top of K and V to reduce computational load. However, I'm encountering...

david-PHR

triaged

Could not find any implementation for node /network.0/network.0.0/norm1/LayerNormalization.

5

## Description I customized TensorRT's Col2Im plugin, recompiled the source code of TensorRT8.5, and generated a new nvinfer_plugin library. This is the LayerNormalization node information in the model, ![image](https://github.com/NVIDIA/TensorRT/assets/19351259/ef132e1d-23ef-4ff0-8ec5-33fc7ee9aa4b) So...

demuxin

triaged

Tensorrt fp32 inference slower than pytorch on tesla T4 for GroundingDINO

3

## Description I convert groundingdino from torch to tensorrt on A100, which can accelarate 50% on inference. However, when I deploy the same model on T4, after I rebuild engine,...

shuchang0714

triaged

Conditions / example of DepSepConvolution fusion

16

https://docs.nvidia.com/deeplearning/tensorrt/developer-guide/index.html#fusion-types says: **Depthwise Separable Convolution** `A depthwise convolution with activation followed by a convolution with activation `**may sometimes**` be fused into a single optimized DepSepConvolution layer. The precision of both...

vadimkantorov

triaged

about GroundingDINO tensorrt acceleration questions？

9

We have exported onnx through the script provided by the enthusiastic experts. The onnx file takes about 5 seconds to infer an image using CPU, and 2 seconds to infer...

xiyangyang99

triaged

TensorRT
TensorRT copied to clipboard

Metadata

Why can't we set the precision of all layers to fp16 or fp32?

[no instance of function template] when installing TensorRT 8.6.1.6

Using tensorflow quantisation toolkit, how to export to tflite correctly?

TensorRT 8.5 deprecated functions

The same engine produces incorrect inference output using C++, but correct results using Python. TensorRT-8.6.1.6+cuda12.1

Custom Attention implementation not well optimised by TensorRT

Could not find any implementation for node /network.0/network.0.0/norm1/LayerNormalization.

Tensorrt fp32 inference slower than pytorch on tesla T4 for GroundingDINO

Conditions / example of DepSepConvolution fusion

about GroundingDINO tensorrt acceleration questions？

← Metadata

Owner

Metadata

TensorRT TensorRT copied to clipboard

Metadata

← Metadata

Owner

Metadata

TensorRT
TensorRT copied to clipboard