TensorRT issues

🐛 [Bug] TensorRT-RTX BatchNorm constant fold got nan

## Bug Description Got nan result if batchnorm constant fold is enabled in RTX if disable the batchnorm constant fold in RTX, it is working as expected. ## To Reproduce...

lanluo-nvidia

bug

🐛 [Bug] Llama2_flashinfer_rmsnorm example is broken

Logs: ``` File "/usr/local/lib/python3.12/dist-packages/torch/_ops.py", line 524, in __call__ return wrapper() ^^^^^^^^^ File "/usr/local/lib/python3.12/dist-packages/torch/_ops.py", line 515, in wrapper return torch.overrides.handle_torch_function( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.12/dist-packages/torch/overrides.py", line 1725, in handle_torch_function result = mode.__torch_function__(public_api, types,...

cehongwang

bug

cherry pick 3700 to 2.8 release: Broadcast removal

# Description Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change....

cehongwang

documentation

component: tests

component: lowering

component: conversion

component: converters

component: build system

component: api [Python]

component: api [C++]

component: runtime

cla signed

component: fx

fx

component: dynamo

component: torch_compile

📖 [Story] Converters optimization

## TL;DR Based on the PR #3732, converters need to be optimized to meet the requirement of converter unit tests that Torch-TRT is expected to have the same num of...

zewenli98

Story

🐛 [Bug] Torch-TRT has more Reformatting than ONNX-TRT

## Bug Description Torch-TRT has more Reformatting than ONNX-TRT, which affects perf. For example: ``` , { "name" : "Reformatting CopyNode for Input Tensor 1 to [CONVOLUTION]-[aten_ops.convolution.default]-[model.0.residual/convolution] + [ELEMENTWISE]-[aten_ops.add.Tensor]-[model.0/add]", "timeMs"...

zewenli98

bug

TensorRT
TensorRT copied to clipboard

Metadata

🐛 [Bug] TensorRT-RTX BatchNorm constant fold got nan

🐛 [Bug] Llama2_flashinfer_rmsnorm example is broken

cherry pick 3700 to 2.8 release: Broadcast removal

📖 [Story] Converters optimization

🐛 [Bug] Torch-TRT has more Reformatting than ONNX-TRT

← Metadata

Owner

Metadata

TensorRT TensorRT copied to clipboard

Metadata

🐛 [Bug] TensorRT-RTX BatchNorm constant fold got nan

🐛 [Bug] Llama2_flashinfer_rmsnorm example is broken

cherry pick 3700 to 2.8 release: Broadcast removal

📖 [Story] Converters optimization

🐛 [Bug] Torch-TRT has more Reformatting than ONNX-TRT

← Metadata

Owner

Metadata

TensorRT
TensorRT copied to clipboard