inference icon indicating copy to clipboard operation
inference copied to clipboard

BERT model int8 comparison

Open lixiaolx opened this issue 3 years ago • 1 comments

Can onnx(from tf) and torch(from hugging) match the corresponding model under the int8 model? The operators of the last few layers of the onnx model in the current int8 mode and the torch model are completely inconsistent with the final model output? Inconvenient to use and performance analysis comparison

lixiaolx avatar Oct 20 '22 06:10 lixiaolx