ncnn The benchmark shows more lower speed in int8 than float. It cann't satisfy our expection

The benchmark shows more lower speed in int8 than float. It cann't satisfy our expection

Open aqsc opened this issue 2 years ago • 1 comments

expectation

https://github.com/Tencent/ncnn/tree/master/benchmark From the benchmark website, it shows the inference in int8 quantization cannot be faster than float32 in arm platform and other platforms. It doesn't satisfy our expection.

Sep 17 '22 08:09 aqsc

Because float32 is so well optimized than int8

Sep 19 '22 00:09 wzyforgit

The data in the benchmark readme is likely out of date The new version optimizes the inference performance of int8 Welcome to update and test again. If there are any problems, please reopen the issue.

Dec 15 '23 09:12 nihui

ncnn ncnn copied to clipboard

The benchmark shows more lower speed in int8 than float. It cann't satisfy our expection

expectation

ncnn
ncnn copied to clipboard