TensorRT issues

Windows CI for Python-Only Implementation

3

- Get CI up and running for Windows implementation using only the Python runtime

❓ [Question] How to compile a model with A16W8?

1

Hi Torch-TensorRT team: I'm wondering how can I compile a model with 8 bit weights, but using 16 bit activations? Thanks a lot!

jiangwei221

question

🐛 [Bug] Expected ivalues_maps.count(input) to be true but got false

8

## Bug Description RuntimeError: [Error thrown at core/partitioning/shape_analysis.cpp:183] Expected ivalues_maps.count(input) to be true but got false Could not find torch::jit::Value* hidden.1 produced from %hidden.1 : (Tensor, Tensor) = prim::TupleConstruct(%440, %440)...

Charlyo

bug

📖 [Story] Coverage for Core ATen Ops

2

## TL;DR Prioritze coverage for the [core ATen opset](https://pytorch.org/docs/main/torch.compiler_ir.html#core-aten-ir). ## Goal(s) - Determine priority order, based on criteria such as key model requirements, which operators from the core ATen opset...

gs-olive

Story

[DEBUG] feat: First pass at Windows CI

# Description Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change....

atalman

component: lowering

component: build system

component: api [Python]

cla signed

component: dynamo

aten.adaptive_avg_pool2d

zewenli98

component: converters

aten.adaptive_avg_pool3d

zewenli98

component: converters

feat: First pass at Windows CI

# Description - Update `setup.py` to enable `bdist_wheel` build - Add GHA tooling for Windows Fixes #2489 ## Type of change - New CI Path # Checklist: - [ ]...

gs-olive

component: lowering

component: build system

WIP

component: api [Python]

cla signed

component: dynamo

Refactor `truncate_long_and_double` in Dynamo

1

- Explicit considerations of truncation of inputs versus truncation of constants Sourced from https://github.com/pytorch/TensorRT/pull/2457#issuecomment-1889824984: - Avoid running PyTorch graphs with invalid casts - Refactor `repair_long_and_double` to consume output of type...

gs-olive

component: dynamo

❓ [Question] mlp running with torch_tensorrt slower than with inductor？

3

## ❓ Question I am within the nvcr.io/nvidia/pytorch:23.12-py3 container. The performance of torch_tensorrt is wrose than inductor. Details: example code ```python import torch import torch_tensorrt import torch.nn as nn class...

johnzlli

question

TensorRT
TensorRT copied to clipboard

Metadata

Windows CI for Python-Only Implementation

❓ [Question] How to compile a model with A16W8?

🐛 [Bug] Expected ivalues_maps.count(input) to be true but got false

📖 [Story] Coverage for Core ATen Ops

[DEBUG] feat: First pass at Windows CI

aten.adaptive_avg_pool2d

aten.adaptive_avg_pool3d

feat: First pass at Windows CI

Refactor `truncate_long_and_double` in Dynamo

❓ [Question] mlp running with torch_tensorrt slower than with inductor？

← Metadata

Owner

Metadata

TensorRT TensorRT copied to clipboard

Metadata

← Metadata

Owner

Metadata

TensorRT
TensorRT copied to clipboard