penglu
penglu
## Description When I use the mixed precision(fp16+fp32) engine, I find that the memory size of the mixed precision engine is very close to the fp32 engine. Here are ops...
## Description I can get the profile by trtexce: `./trtexec --loadEngine=debug_fp16.trt --dumpProfile --shapes=input:1x3x512x512 --exportProfile=debug_profile ` How can I get the debug_profile by python when I convert onnx to trt engine...
## Description When I run `polygraphy run --onnxrt ./my_onnx.onnx --save-inputs onnx_inputs.json` to save the inputs data of my onnx model, I am very puzzled by this onnx_inputs.json The content of...
## Description When I use the TensorRT C++ API and Layernorm plugin to build my network, it's segmentation fault! I debug the Layernorm.cpp, it's wrong in this code: https://github.com/NVIDIA/TensorRT/blob/87f3394404ff9f9ec92c906cd4c39b5562aea42e/plugin/skipLayerNormPlugin/skipLayerNormPlugin.cpp#L430 and...
## Description I want to use TensorRT API to rebuild a model, which contains some conv1d layers [`use torch.Conv1d()`]. Inputs shape is [1,1,82000], and the output of conv1d is [1,512,16399],...
## Description When I use TensorRT API and GeluPlugin to build my network, there has a bug: corrupted double-linked list. ## Environment **TensorRT Version**: 8.4.0.6 **CUDA Version**: 10.2 **CUDNN Version**:...
## Description I use onnx-graphsurgeon to merge gelu ops, and rename Node in graph is "CustomGeluPlugin". I have also replaced plugin name from "CustomGeluPluginDynamic" to "CustomGeluPlugin". But when I use...
## Description When I use the GELU Plugin in my project, it increases the inference time. **Before** GELU Plugin, inference time is **44ms** (fp32). **After** GELU Plugin, inference time is...
### Description ```shell Branch:main GPU: Tesla T4 CUDA: 10.2 TensorRT: 8.4.0.6 Docker: No use ``` ### Reproduced Steps ```shell When I build the BERT(C++) project, there has a error. 1....
I am new to cuml, and when i run ./build.sh,errors are below: **Describe the bug** > CMake Error at CMakeLists.txt:21 (include):include could not find requested file: > rapids-cmake > CMake...