neural-compressor icon indicating copy to clipboard operation
neural-compressor copied to clipboard

add some new features for layer-wise quant

Open n1ck-guo opened this issue 1 year ago • 1 comments

Type of Change

feature

Description

add some new features for layer-wise quant, include get_weight, get_bias, update, and save/load. Make it more easy to use, like a normal model.

Expected Behavior & Potential Risk

None

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

None

n1ck-guo avatar Jul 03 '24 07:07 n1ck-guo

⛈️ Required checks status: Has failure 🔴

Warning If you do not have the access to re-run the Probot, please contact XuehaoSun for help. If you push a new commit, all of the workflow will be re-triggered.

Groups summary

🟢 Code Scan Tests workflow
Check ID Status Error details
Code-Scan success
Code-Scan (Bandit Code Scan Bandit) success
Code-Scan (DocStyle Code Scan DocStyle) success
Code-Scan (Pylint Code Scan Pylint) success

These checks are required after the changes to neural_compressor/torch/algorithms/layer_wise/utils.py.

🔴 Model Tests 3x workflow
Check ID Status Error details
Model-Test-3x failure
Model-Test-3x (Generate Report GenerateReport) no_status
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4) success
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_bnb) failure
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_ggml) failure

These checks are required after the changes to neural_compressor/torch/algorithms/layer_wise/utils.py.

🔴 Unit Tests 3x-PyTorch workflow
Check ID Status Error details
UT-3x-Torch failure
UT-3x-Torch (Coverage Compare CollectDatafiles) failure download
UT-3x-Torch (Unit Test 3x Torch Unit Test 3x Torch) success
UT-3x-Torch (Unit Test 3x Torch baseline Unit Test 3x Torch baseline) success

These checks are required after the changes to neural_compressor/torch/algorithms/layer_wise/utils.py.


Thank you for your contribution! 💜

Note This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact chensuyue or XuehaoSun for help.

github-actions[bot] avatar Jul 03 '24 07:07 github-actions[bot]

marked draft and will migrate to https://github.com/intel/neural-compressor/pull/1883

Kaihui-intel avatar Jul 05 '24 08:07 Kaihui-intel