TensorRT issues

EfficientNMS Plugin

2

In the EfficientNMS Plugin Introduction, "After sorting, the highest 4096 scores are processed by the EfficientNMS CUDA kernel. " Can this "4096" be configured? In my understanding , the 4096...

1661417727

triaged

Set_binding_shape returns False

1

## Description Set_binding_shape returns False ``` import os import sys sys.path.append(os.path.abspath(os.path.dirname(os.path.dirname(__file__)))) import os.path import numpy as np from multiprocessing import cpu_count import cv2 import shutil from PIL import Image from...

980202006

triaged

Can int be used as input?

1

There is int type data in my inputs for indexing other data, how can I set the int type data as input, with set_input?

980202006

triaged

Stuck and raise Error Code 2: Internal Error (Assertion memSize >= 0 failed. )

5

## Description [08/04/2022-17:24:38] [TRT] [V] *************** Autotuning Reformat: Float(E8,E8,E7,E6,(* 108 E1),108,1) where E0=(- 108 (- (# 3 (SHAPE input_1)) (* 108 (CEIL_DIV (+ (# 3 (SHAPE input_1)) -107) 108)))) E1=(CEIL_DIV...

980202006

triaged

defaultAllocator.cpp::nvinfer1::internal::DefaultAllocator::allocate::63] Error Code 1: Cuda Runtime (out of memory)

3

Change the input image size of `ONNX` model from `1 * 3 * 1024 * 1024` to `1 * 3 * 2000 * 2000`, get an error when `converting ONNX...

geoexploring

triaged

Do I have to do PTQ before QAT with pytorch_quantization toolkit?

7

In the only example provided in the toolkit, it loaded the PTQ calibrated weights and did the QAT based on it. There isn't a standalone QAT example without PTQ. I...

deephog

triaged

Build engine file failed with INT8 calibration mode

5

## Description I export onnx model and build engine with FP16 mode successfully, but the issue occurs when building engine with INT8 calibration mode. The error: ``` ... [06/30/2022-17:48:48] [TRT]...

MaxeeCR

triaged

Get wrong quantization result after setting TensorQuantizer.use_fb_fake_quant = True

1

## Description It is caused by a bug from pytorch, as shown in the issus [ "**torch.fake_quantize_per_tensor_affine will affect the results of the model if the input memory is not...

Wang-Qk

triaged

BART Error: 'BARTTRTDecoder' object has no attribute 'trt_context_non_kv'

16

## Description Follow https://github.com/NVIDIA/TensorRT/blob/main/demo/HuggingFace/notebooks/t5.ipynb to build the tensorRT example for Bart model. Get warning in `bart_trt_decoder = BARTTRTDecoder(bart_trt_decoder_engine, metadata, tfm_config)` Cannot find binding of given name: past_key_values.0.decoder.key and error in...

Luckick

triaged

Opencv compute graph, TensorRT support

1

Hey, ## Background Opencv introduce Compute graph, which every Opencv operation can be describe as graph op code. They took it further and, introduces the ability to use inference on...

oak-tree

triaged

TensorRT
TensorRT copied to clipboard

Metadata

EfficientNMS Plugin

Set_binding_shape returns False

Can int be used as input?

Stuck and raise Error Code 2: Internal Error (Assertion memSize >= 0 failed. )

defaultAllocator.cpp::nvinfer1::internal::DefaultAllocator::allocate::63] Error Code 1: Cuda Runtime (out of memory)

Do I have to do PTQ before QAT with pytorch_quantization toolkit?

Build engine file failed with INT8 calibration mode

Get wrong quantization result after setting TensorQuantizer.use_fb_fake_quant = True

BART Error: 'BARTTRTDecoder' object has no attribute 'trt_context_non_kv'

Opencv compute graph, TensorRT support

← Metadata

Owner

Metadata

TensorRT TensorRT copied to clipboard

Metadata

← Metadata

Owner

Metadata

TensorRT
TensorRT copied to clipboard