neural-compressor New API ONNXRT example update

Type of Change

example

Description

update ONNXRT example for new API

JIRA ticket: ILITV-2468

How has this PR been tested?

extension test on onnx models

Dependency Change?

no

Nov 29 '22 09:11 yuwenzho

hi @chensuyue, PR is ready for extension test

Dec 09 '22 07:12 yuwenzho

extension test

pls check the tuning regression.
benchmark.sh api gap.

Dec 12 '22 02:12 chensuyue

@chensuyue extension test: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3784/artifact/report.html

performance regression is caused by switching performance dataset from dummy to real dataset.

Dec 16 '22 05:12 yuwenzho

extension test for the other examples.

Dec 16 '22 06:12 chensuyue

https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3851/

Dec 19 '22 11:12 yuwenzho

https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3877/ Note: object detection models need new quantization recipe support from Strategy team and may not pass extension test now.

Dec 21 '22 02:12 yuwenzho

https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3877/ Note: object detection models need new quantization recipe support from Strategy team and may not pass extension test now.

NLP models failed due to some typos and code changes not working. Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3883/

Dec 21 '22 10:12 yuwenzho

https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3877/ Note: object detection models need new quantization recipe support from Strategy team and may not pass extension test now.

NLP models failed due to some typos and code changes not working. Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3883/

Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3890/ yolov3, yolov4 and tiny_yolov3 will not be enabled in this version because 'onnxrt.graph_optimization.level' is not supported now.

Dec 22 '22 02:12 yuwenzho

Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3890/ yolov3, yolov4 and tiny_yolov3 will not be enabled in this version because 'onnxrt.graph_optimization.level' is not supported now.

ssd-12, ssd-12_qdq, faster_rcnn, faster_rcnn_qdq, mask_rcnn, mask_rcnn_qdq will be re-enabled in 2.1 with supported 'onnxrt.graph_optimization.level' and quantization recipe. Please ignore them in extension test.
hf model failed with error: 'setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (4,) + inhomogeneous part.', which is caused from numpy version update. issue

Update:

remove ssd, faster_rcnn and mask_rcnn model
update model config json
add numpy==1.23.5 into requirements.txt in huggingface model

Retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3909/

Dec 23 '22 05:12 yuwenzho

passed: bert_squad_model_zoo_dynamic, mobilebert_squad_mlperf_dynamic, mobilebert_squad_mlperf_qdq, duc, BiDAF_dynamic and huggingface question answering models

failed: gpt2_lm_head_wikitext_model_zoo_dynamic and huggingface test classification models, retest: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3913/

Dec 23 '22 10:12 yuwenzho

passed: bert_squad_model_zoo_dynamic, mobilebert_squad_mlperf_dynamic, mobilebert_squad_mlperf_qdq, duc, BiDAF_dynamic and huggingface question answering models: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3908/artifact/report.html gpt2_lm_head_wikitext_model_zoo_dynamic and huggingface test classification models: https://inteltf-jenk.sh.intel.com/job/intel-lpot-validation-top-mr-extension/3919/artifact/report.html

Dec 26 '22 01:12 yuwenzho

neural-compressor neural-compressor copied to clipboard

New API ONNXRT example update

Type of Change

Description

How has this PR been tested?

Dependency Change?

neural-compressor
neural-compressor copied to clipboard