TensorRT issues

feat: improve engine caching and fix bugs

1

# Description As I requested, TensorRT 10.14 added an argument `trt.SerializationFlag.INCLUDE_REFIT` to allow refitted engines to keep refittable. That means engines can be refitted multiple times. Based on the capability,...

zewenli98

component: tests

component: conversion

component: core

component: api [Python]

cla signed

component: dynamo

component: torch_compile

add uv update workflow

# Description Here is the CI pipeline: https://github.com/pytorch/TensorRT/actions/runs/20152165935/job/57847168432 Here is the auto commit record: https://github.com/pytorch/TensorRT/commit/ab76c1db4d6c91c56956308a1db2f7ce37fa7fad pytorch upgraded the cuda from [13.0.0 to 13.0.2](https://github.com/pytorch/pytorch/commit/544b443ea1d1a9b19e65f981168a01cb87a2d333) which upgraded the nvidia-cuda-runtime==13.0.96 however tensorrt_cu13 has...

lanluo-nvidia

component: build system

cla signed

🐛 [Bug] Exporting engine with `hardware_compatible` does not create hardware compatible egine

3

## Bug Description ```py from tensorrt import Logger, Runtime from torch import randn from torchvision.models import mobilenet_v2, MobileNet_V2_Weights from torch_tensorrt import convert_method_to_trt_engine # Create model weights = MobileNet_V2_Weights.DEFAULT model =...

olokobayusuf

bug

backend: TensorRT

Dynamic memory allocation

# Description Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change....

cehongwang

component: tests

component: core

component: api [Python]

component: runtime

cla signed

component: dynamo

Add a testcase for ts compile -> saving with torch_tensorrt.save

# Description Adds a testcase to cover #3775 ## Type of change Please delete options that are not relevant and/or add your own. - Bug fix (non-breaking change which fixes...

narendasan

component: tests

component: api [Python]

cla signed

✨[Feature] Use a R/W Lock to manage building engines for MGMN setups

**Is your feature request related to a problem? Please describe.** **Describe the solution you'd like** An implementation of the engine cache that is concurrency aware. Should spin lock repeated requests...

narendasan

feature request

✨[Feature] Add reduction targets in Autocast

2

Need to add reduction targets in Autocast DepthOfReductionRule

zewenli98

feature request

example: using nvrtc kernel for aot plugin

2

# Description Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change....

bowang007

cla signed

🐛 [Bug] Symbolic shape error when using repeat_interleave for K and V followed by SDPA

2

## Bug Description ``` from contextlib import nullcontext import torch import torch.nn as nn import torch.nn.functional as F import torch_tensorrt class SampleNetwork(nn.Module): def __init__( self, num_attention_heads: int, ) -> None:...

zhaoyuanh

bug

Adding rank based logging for torch distributed examples

This PR 1. Adds rank based logging for the distributed examples 2. Corrects the fallback to pytorch case for NCCL converters 3. This with #3830 provides utilities for running distributed...

apbose

component: tests

component: conversion

component: api [Python]

cla signed

component: dynamo

component: torch_compile

TensorRT
TensorRT copied to clipboard

Metadata

feat: improve engine caching and fix bugs

add uv update workflow

🐛 [Bug] Exporting engine with `hardware_compatible` does not create hardware compatible egine

Dynamic memory allocation

Add a testcase for ts compile -> saving with torch_tensorrt.save

✨[Feature] Use a R/W Lock to manage building engines for MGMN setups

✨[Feature] Add reduction targets in Autocast

example: using nvrtc kernel for aot plugin

🐛 [Bug] Symbolic shape error when using repeat_interleave for K and V followed by SDPA

Adding rank based logging for torch distributed examples

← Metadata

Owner

Metadata

TensorRT TensorRT copied to clipboard

Metadata

← Metadata

Owner

Metadata

TensorRT
TensorRT copied to clipboard