Nick Comly
Nick Comly
CC: @narendasan @andi4191 Do we have test cases in DLFW for FX?
Hi @lixiaolx is the model trained using the [PyTorch QAT toolkit](https://github.com/NVIDIA/TensorRT/tree/main/tools/pytorch-quantization)?
@borisfom is this issue still present with the latest APIs? Do you have a preference for CLI vs API?
Between these two versions there was a constant time operation that was added to check compatibility of the current device with the compiled model. This is likely the overhead you...
@narendasan seems a lowering pass could be a good WAR here.
@peri044 bump. @narendasan we should bring up with TRT team in our sync Friday
The outstanding layers have been added as feature requests. Adding the bug tag since `require_full_compilation = False` so partial compilation should work.
@peri044 for another PTQ bug. P1
This is a constraint on data dependent shapes (DDS), currently slated for v1.4 end of year.
Hi @wzywzywzy this is because of TensorRT's kernel autotuning. TRT selects the fastest kernels for your model, regardless of sparsity, so in this case the dense kernels may be faster...