TensorRT issues

Revised the lowering pass according to Bo's suggestion

# Description Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change....

cehongwang

cla signed

✨[Feature] ONNX vs. ATen Perf Testing.

1

**Is your feature request related to a problem? Please describe.** Compare ONNX vs ATen subgraph perf **Describe the solution you'd like** **Describe alternatives you've considered** **Additional context**

narendasan

feature request

🐛 [Bug] Model Tests Broken If Running In Parallel

## Bug Description This fixed temporary [file path](https://github.com/pytorch/TensorRT/blob/ca0765c444af2ddd6cc4c6d0361887f95f2208d9/tests/py/dynamo/models/test_export_serde.py#L20) used by all tests will cause racing condition if tests are running in parallel. ## Expected behavior Tests should be fine if...

leimao

bug

component: tests

✨[Feature] Pervent Custom Ops from Registering Plugins multiple times

**Is your feature request related to a problem? Please describe.** **Describe the solution you'd like** If I have multiple custom ops, I can create a plugin for each, but each...

narendasan

feature request

chore: Update lock file, was getting stuck and causing build issues f…

2

…or folks locally # Description The lock file had a stale version of torch in it and some symbols got shifted around. Caused the C++ build to use latest but...

narendasan

component: build system

cla signed

🐛 [Bug] Dynamic Shape Type Mismatch Error When Using Static Shape

4

## Bug Description ## To Reproduce Steps to reproduce the behavior: 1. Install the torch_tensorrt wheels found at https://pypi.jetson-ai-lab.io/jp6/cu126 (2.8 for cu126) on a Jetson Orin Nano running Jetpack 6.2...

henrymmorton

bug

Llama distributed example

The PR addresses 1. Llama3 end to end example with complex graph lowering 2. Removal of hardcoded components of rotary embedding example

apbose

component: lowering

component: api [Python]

component: runtime

cla signed

component: dynamo

component: torch_compile

Changes to TRT-LLM download tool for multigpu distributed case

TRT-LLM installation tool for distributed 1. The download is to be done by only one GPU to avoid unnecessary downloads 2. Use of lock files in the tool for the...

apbose

component: tests

component: api [Python]

cla signed

✨[Feature] Support automatically generating AOT QDP Plugins

**Is your feature request related to a problem? Please describe.** We support generating JIT QDP plugins automatically from PyTorch custom op registrations, but the performance makes them pretty unusable. Therefore...

narendasan

feature request

📖 [Story] Support using Custom Kernels in Torch-TensorRT

## TL;DR ## Goal(s) ## Tasks ```[tasklist] ### Tasks ``` ## Additional context

narendasan

Story

TensorRT
TensorRT copied to clipboard

Metadata

Revised the lowering pass according to Bo's suggestion

✨[Feature] ONNX vs. ATen Perf Testing.

🐛 [Bug] Model Tests Broken If Running In Parallel

✨[Feature] Pervent Custom Ops from Registering Plugins multiple times

chore: Update lock file, was getting stuck and causing build issues f…

🐛 [Bug] Dynamic Shape Type Mismatch Error When Using Static Shape

Llama distributed example

Changes to TRT-LLM download tool for multigpu distributed case

✨[Feature] Support automatically generating AOT QDP Plugins

📖 [Story] Support using Custom Kernels in Torch-TensorRT

← Metadata

Owner

Metadata

TensorRT TensorRT copied to clipboard

Metadata

← Metadata

Owner

Metadata

TensorRT
TensorRT copied to clipboard