Po-Wei (Vincent) comments

Results 70 comments of


                                            Po-Wei (Vincent)

No such file 'libtensorrt_llm.so' while building wheel

@Shixiaowei02 any updates on this?

ideas

As said, TensorRT is mainly for inference optimizations. For training related issues please refer to each frameworks' approach. Thanks!

PTQ support for ViT models

Might have mis-read. Thanks for the response. Let me check and get back to you.

Also, `pytorch_quantization` will not receive further development as stated [here](https://github.com/NVIDIA/TensorRT/tree/release/10.8/tools/pytorch-quantization). TensorRT-Model-Optimizer is now the encouraged path.

How obtain the classification label of BERT model?

@symphonylyh any updates on this?

fails to parse valid onnx model: API Usage Error (node_of_reduce_min_output: at least 1 dimensions are required for input.)

Seems like the `ReduceMin` is getting a scalar instead of a 1D tensor. Can you check if setting `keepdims=1` for the `ReduceSumSquare` OP works? See [ONNX spec for ReduceSumSquare](https://github.com/onnx/onnx/blob/main/docs/Operators.md#attributes-86) Another...

Po-Wei (Vincent)

No such file 'libtensorrt_llm.so' while building wheel

ideas

PTQ support for ViT models

PTQ support for ViT models

How obtain the classification label of BERT model?

fails to parse valid onnx model: API Usage Error (node_of_reduce_min_output: at least 1 dimensions are required for input.)

对Yolov8n.pt转wts后，再序列化构建时，addResize()函数返回值为空，应如何解决？

detectron2 faster rcnn to tensor rt

[DRAFT] Introducing multi-vocab token sampling for audio generation

chore: better quantization calibration loop for modelopt