neural-compressor
neural-compressor copied to clipboard
add mx quant
Type of Change
feature
Description
support mx quant
Expected Behavior & Potential Risk
the expected behavior that triggered by this PR
How has this PR been tested?
UT
Dependency Change?
any library dependency introduced or removed
⛈️ Required checks status: Has failure 🔴
Warning If you do not have the access to re-run the Probot, please contact XuehaoSun for help. If you push a new commit, all of the workflow will be re-triggered.
Groups summary
🟢 Code Scan Tests workflow
| Check ID | Status | Error details | |
|---|---|---|---|
| Code-Scan | success | ✅ | |
| Code-Scan (Bandit Code Scan Bandit) | success | ✅ | |
| Code-Scan (DocStyle Code Scan DocStyle) | success | ✅ | |
| Code-Scan (Pylint Code Scan Pylint) | success | ✅ |
These checks are required after the changes to neural_compressor/common/utils/constants.py, neural_compressor/torch/algorithms/mx_quant/__init__.py, neural_compressor/torch/algorithms/mx_quant/mx.py, neural_compressor/torch/algorithms/mx_quant/utils.py, neural_compressor/torch/quantization/__init__.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/quantization/config.py.
🟢 Model Tests 3x workflow
| Check ID | Status | Error details | |
|---|---|---|---|
| Model-Test-3x | success | ✅ | |
| Model-Test-3x (Generate Report GenerateReport) | success | ✅ | |
| Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4) | success | ✅ | |
| Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_bnb) | success | ✅ | |
| Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_ggml) | success | ✅ |
These checks are required after the changes to neural_compressor/common/utils/constants.py, neural_compressor/torch/algorithms/mx_quant/__init__.py, neural_compressor/torch/algorithms/mx_quant/mx.py, neural_compressor/torch/algorithms/mx_quant/utils.py, neural_compressor/torch/quantization/__init__.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/quantization/config.py.
🟢 Unit Tests 3x-TensorFlow workflow
| Check ID | Status | Error details | |
|---|---|---|---|
| UT-3x-TensorFlow | success | ✅ | |
| UT-3x-TensorFlow (Coverage Compare CollectDatafiles) | success | ✅ | |
| UT-3x-TensorFlow (Unit Test 3x TensorFlow Unit Test 3x TensorFlow) | success | ✅ | |
| UT-3x-TensorFlow (Unit Test 3x TensorFlow baseline Unit Test 3x TensorFlow baseline) | success | ✅ |
These checks are required after the changes to neural_compressor/common/utils/constants.py.
🔴 Unit Tests 3x-PyTorch workflow
| Check ID | Status | Error details | |
|---|---|---|---|
| UT-3x-Torch | failure | ❌ | |
| UT-3x-Torch (Coverage Compare CollectDatafiles) | failure | download | ❌ |
| UT-3x-Torch (Unit Test 3x Torch Unit Test 3x Torch) | success | ✅ | |
| UT-3x-Torch (Unit Test 3x Torch baseline Unit Test 3x Torch baseline) | success | ✅ |
These checks are required after the changes to neural_compressor/common/utils/constants.py, neural_compressor/torch/algorithms/mx_quant/__init__.py, neural_compressor/torch/algorithms/mx_quant/mx.py, neural_compressor/torch/algorithms/mx_quant/utils.py, neural_compressor/torch/quantization/__init__.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/quantization/config.py, test/3x/torch/quantization/test_mx_quant.py.
🔴 Unit Tests 3x-ONNXRT workflow
| Check ID | Status | Error details | |
|---|---|---|---|
| UT-3x-ONNXRT | failure | ❌ | |
| UT-3x-ONNXRT (Coverage Compare CollectDatafiles) | no_status | ❓ | |
| UT-3x-ONNXRT (Unit Test 3x ONNXRT Unit Test 3x ONNXRT) | failure | download | ❌ |
| UT-3x-ONNXRT (Unit Test 3x ONNXRT baseline Unit Test 3x ONNXRT baseline) | failure | download | ❌ |
These checks are required after the changes to neural_compressor/common/utils/constants.py.
Thank you for your contribution! 💜
Note This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact chensuyue or XuehaoSun for help.
@mengniwang95 any update?
Does this feature target v2.6?