the performance of onxx fp16 seems to be even worse than that of onnx fp32

Open lierer007 opened this issue 2 years ago • 0 comments

src/transformer_deploy/convert.py

model_path = onnx_model_path if is_fp16 else optim_model_paths[0]

Maybe there is something wrong with this line of code, which causes the experimental results to appear that the performance of onnx fp16 is even worse than that of onnx fp32.

Nov 03 '23 08:11 lierer007