inference BERT model int8 comparison

BERT model int8 comparison

Open lixiaolx opened this issue 3 years ago • 1 comments

Can onnx（from tf） and torch（from hugging） match the corresponding model under the int8 model? The operators of the last few layers of the onnx model in the current int8 mode and the torch model are completely inconsistent with the final model output? Inconvenient to use and performance analysis comparison

Oct 20 '22 06:10 lixiaolx

@lixiaolx Can you provide more details on the differences?

Nov 29 '22 17:11 rnaidu02

Not relevant anymore, closing issue

Oct 14 '25 05:10 pgmpablo157321

inference inference copied to clipboard

BERT model int8 comparison

inference
inference copied to clipboard