Lenan22 comments

Results 31 comments of


                                            Lenan22

Xavier yolo to Int8 met error

> Hi I've met the error below during converting from onnx model to int8 trt engine on Xavier > > I just followed the step in instructions and exec: python3...

> 我检查了ppq.parser.tensorRT.py 196行处的代码input_shapes已经被设置为[32,3,640,640],并且在247行for循环后添加了下面的代码，设置了proile，min.opt，max的尺寸都是[32,3,640,640]。 min, opt, max = input_shapes[inp.name] profile.set_shape_input(inp.name, min, opt, max) > > ``` > config.add_optimization_profile(profile) > trt_engine = builder.build_engine(network, config) > ``` > > 但是还是会出现上面这个问题。 > > 后来我修改了04_Benchmark.py中的代码，将batch_size设置成了1,将inputshape设置成了[32,3,640,640],代码也可以运行，相对于fp32的也实现了快10倍的加速（基于3090显卡）。...

conv 量化误差过大？

> 请问一下我这边有个模型，量化分析的时候误差比较大，这个怎么优化处理？选的TargetPlatform.PPL_CUDA_INT8 默认设置。 ![image](https://user-images.githubusercontent.com/30458099/193197735-54f4a20a-b77c-4180-8212-fbc0e2343490.png) 如果使用了优化算法，网络的中间某一层出现的误差较大，是正常的情况(无标签训练有时会出现这种情况)，具体要看最后一层的模拟误差以及在硬件上跑的效果

int8 TensorRT

Please refer to our open source quantization tool ppq, the quantization result is better than the quantization tool that comes with tensorrt, almost the same as the float32 model. https://github.com/openppl-public/ppq/blob/master/md_doc/deploy_trt_by_OnnxParser.md

Lenan22

Xavier yolo to Int8 met error

如何设置engine max batch size？

conv 量化误差过大？

int8 TensorRT

int8转换问题，能转成，但检测不到目标

int8转换问题，能转成，但检测不到目标

int8转换问题，能转成，但检测不到目标

Yolov5s' INT8 model gave very poor result.

int8 engine generation failed

int8预测结果不对