Michal Guzek comments

Results 93 comments of


                                            Michal Guzek

INT8EntropyCalibrator2 implicit quantization superseded by explicit quantization

@adaber , I don't see Windows in https://nvidia.github.io/TensorRT-Model-Optimizer/getting_started/2_installation.html so will let @riyadshairi979 to confirm

INT8EntropyCalibrator2 implicit quantization superseded by explicit quantization

@ckolluru , in theory, INT8 inference should generally offer better performance than FP16 due to lower precision calculations, which use less computational power and memory bandwidth. However, I believe for...

INT8EntropyCalibrator2 implicit quantization superseded by explicit quantization

@maisa32 sorry for the late reply, I've just checked with the PM team and: - Long term solution: DLA3 will support both EQ and IQ - Short/medium term solution: -...

Why didn't covert t5xxl into engine in sd3 pipeline?

CC @akhilg-nv or @asfiyab-nvidia who are more up to speed with demos

TRT Unet support for Juggernaut ?

@ChuRuaNh0 , I will be closing this ticket due to our policy to close tickets with no activity for more than 21 days after a reply had been posted. Please...

Inference failure of TensorRT 10.0.x.x when running my internal model on GPU(T4, A100)

@kimdwkimdw can you send the model to [email protected], I can instance an internal bug for this

Cannot install tensorRT==9.3.0

@kylechang523 , I will be closing this ticket due to our policy to close tickets with no activity for more than 21 days after a reply had been posted. Please...

Could not decode serialized type: np.ndarray. This could be because a required module is missing

Have you serialized the NumPy arrays properly using JSON-compatible formats, i.e. like: ``` input_data = np.random.rand(1, 3, 384, 640).astype(np.float32) input_data_list = input_data.tolist() with open("custom.json", "w") as f: json.dump({"input": input_data_list}, f)...

output result error of TensorRT 10.0.1.6 when running conv+clip structure on GPU NVIDIA L4

@yikox , I will be closing this ticket due to our policy to close tickets with no activity for more than 21 days after a reply had been posted. Please...

TensorRT 10.3 is 3+ times slower than p ytorch when running inference on Gpus A30 and 4090

According to the issue, the problem seems to be with the node that was offloaded to one of our backend DL graph compilers so we can investigate it internally. Can...