TensorRT icon indicating copy to clipboard operation
TensorRT copied to clipboard

🐛 [Bug] pdist got accuracy issue in dyanmic shape on tensorrt 10.14.1.48 and tensorrt_rtx 1.2

Open lanluo-nvidia opened this issue 2 months ago • 0 comments

Bug Description

FAILED conversion/test_pdist_aten.py::TestDynamicShapePdistConverter::test_pdist_float_4_dim0_dynamic_dim1_dynamic_p_other - AssertionError: Tensor-likes are not close!

Mismatched elements: 4 / 6 (66.7%) Greatest absolute difference: 0.291101336479187 at index (5,) (up to 0.005 allowed) Greatest relative difference: 0.23916198313236237 at index (5,) (up to 0.005 allowed)

To execute this test, run the following from the base repo dir: python test_pdist_aten.py TestDynamicShapePdistConverter.test_pdist_float_4_dim0_dynamic_dim1_dynamic_p_other

This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 ==== 1 failed, 1857 passed, 50 skipped, 9757 warnings in 102.78s (0:01:42) =====

https://github.com/pytorch/TensorRT/actions/runs/19379017156/job/55454605573

It is failed for both tensorrt 10.14 and tensorrt_rtx 1.2, it is only failing when all the dimensions are dynamic shape. It was successful in tensorrt 10.13 and tensrort_rtx 1.0.

Steps to reproduce the behavior:

Expected behavior

Environment

Build information about Torch-TensorRT can be found by turning on debug messages

  • Torch-TensorRT Version (e.g. 1.0.0):
  • PyTorch Version (e.g. 1.0):
  • CPU Architecture:
  • OS (e.g., Linux):
  • How you installed PyTorch (conda, pip, libtorch, source):
  • Build command you used (if compiling from source):
  • Are you using local sources or building from archives:
  • Python version:
  • CUDA version:
  • GPU models and configuration:
  • Any other relevant information:

Additional context

lanluo-nvidia avatar Nov 15 '25 01:11 lanluo-nvidia