TensorRT issues

[Draft] [Don't merge] build with local trt

1

# Description Don't merge, just want to show the change Fixes # (issue) ## Type of change Please delete options that are not relevant and/or add your own. - Bug...

yuanzhedong

component: conversion

component: build system

component: api [Python]

component: dynamo

fix: Fix deconv kernel channel num_output_maps where wts are ITensor

1

# Description Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change....

andi4191

component: tests

component: conversion

component: core

component: converters

cla signed

Add TORCHTRT_CHECK to execute_engine

1

# Description Convert some potential TensorRT errors (returned boolean is false) into proper exceptions, at runtime. Fixes #2367 ## Type of change - Bug fix (non-breaking change which fixes an...

gcuendet

component: core

component: runtime

cla signed

🐛 [Bug] scaled_dot_product_attention test in Torchscript failure

2

## Bug Description //tests/core/conversion/converters:test_scaled_dot_product_attention (C++ exception with description "0 INTERNAL ASSERT FAILED at "../torch/csrc/jit/ir/alias_analysis.cpp":615, please report a bug to PyTorch. We don't have an op for aten::scaled_dot_product_attention but it isn't...

peri044

bug

🐛 [Bug] test_lowering/lower_linear is flaky.

## Bug Description Observing threshold failures. Test passes occasionally. FAILED lowering/test_aten_lowering_passes.py::TestLowerLinear::test_lower_linear - AssertionError: 0.00113677978515625 != 0 within 4 places (0.00113677978515625 difference) : Linear TRT outputs don't match with the original...

peri044

bug

📖 [Story] Converter coverage for GPT2

## TL;DR operation converters in dynamo to support full compilation for GPT2 ## Goal(s) Run GPT2 on multi-gpu with only 1 TensorRT engine. ## Tasks ```[tasklist] ### Tasks - [...

bowang007

Story

🐛 [Bug] Encountered bug for nn.LSTM with half dtype

2

## Bug Description When running the compiled LSTM model for half dtype with torch-tensorrt, I get this errors: `RuntimeError: Input and parameter tensors are not the same dtype, found input...

johnzlli

bug

🐛 [Bug] Expected input tensors to have type Half, found type float

13

## Bug Description TensorRT throws error about fp32 tensors input despite I am using fp16 tensors as input. I attached the file `IFRNet.py` adapted from [https://github.com/ltkong218/IFRNet/blob/main/models/IFRNet.py](url) ## To Reproduce Steps...

thesword53

bug

✨[Feature] Add inputs as metadata to exported programs/graphmodules to avoid passing them to save/compile APIs

**Is your feature request related to a problem? Please describe.** Our current workflow ```py ep = torch.export.export(model, (inputs,)) trt_gm = torch_tensorrt.dynamo.compile(ep, inputs=[inputs]) torch_tensorrt.save(trt_gm, "trt.ep", inputs=[inputs]) ``` Desired workflow: ```py ep...

peri044

feature request

🐛 [Bug] Error Code 2: OutOfMemory (no further information)

1

## Bug Description ## To Reproduce Steps to reproduce the behavior: input_data = torch.rand([1, 3, 1280, 720]).cuda(device) print(type(input_data)) # input_data = input_data.to(device) # Trace the module with example data traced_model...

wuhongsheng

bug

TensorRT
TensorRT copied to clipboard

Metadata

[Draft] [Don't merge] build with local trt

fix: Fix deconv kernel channel num_output_maps where wts are ITensor

Add TORCHTRT_CHECK to execute_engine

🐛 [Bug] scaled_dot_product_attention test in Torchscript failure

🐛 [Bug] test_lowering/lower_linear is flaky.

📖 [Story] Converter coverage for GPT2

🐛 [Bug] Encountered bug for nn.LSTM with half dtype

🐛 [Bug] Expected input tensors to have type Half, found type float

✨[Feature] Add inputs as metadata to exported programs/graphmodules to avoid passing them to save/compile APIs

🐛 [Bug] Error Code 2: OutOfMemory (no further information)

← Metadata

Owner

Metadata

TensorRT TensorRT copied to clipboard

Metadata

← Metadata

Owner

Metadata

TensorRT
TensorRT copied to clipboard