TensorRT
TensorRT copied to clipboard
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
# Description A converter for the torch.ops.aten._local_scalar_dense operation, a low-level operation in PyTorch used internally to extract a scalar value from a tensor containing a single element. To reviewer, I...
## Bug Description All previous lines work as expected, but cell number 28 in this example notebook: https://github.com/pytorch/TensorRT/blob/main/notebooks/qat-ptq-workflow.ipynb yields this error: ``` W0425 19:29:52.724189 140194752341824 _compile.py:108] Input graph is a...
## Bug Description Using main branch, BERT torchscript test fails FAILED models/test_models.py::TestModels::test_bert_base_uncased - RuntimeError: Schema not found for node. File a bug report. Node: %6932 : int = aten::add(%5840, %6931,...
## ❓ Question I am attempting to execute [PTQ](https://pytorch.org/TensorRT/user_guide/ptq.html). During the compiling process, I get the following exception: ``` DEBUG: [Torch-TensorRT TorchScript Conversion Context] - Finalize: %142 : Tensor =...
Dependency of PR- #2519
## Bug Description Hi All, How are you? I'm trying to save complied module but encountering errors. After I compiled my model I noticed that its type is `OptimizedModule`. Following...
# Description Fix a bug when the module outputs are non-tensors (example: torch.ops.aten.sym_size will return integers) ## Type of change Please delete options that are not relevant and/or add your...
# Description Added a pre-AOT lowering pass to remove `detach` ops Fixes #2657 ## Type of change - New feature (non-breaking change which adds functionality) # Checklist: - [x] My...
## ❓ Question While using a simple model with int8 quantization (pytorch-quantization) when the output layer is deconvolution, torchscript to torch-tensorrt conversion fails with wrong number of output channels. If...
# Description This PR adds FP8 & BF16 datatype support. It also implements converter for FP8 quantized ops. ## Type of change Please delete options that are not relevant and/or...