intel-extension-for-transformers
intel-extension-for-transformers copied to clipboard
add FP8Config
Type of Change
feature
Description
do FP8 quantization using habana
Expected Behavior & Potential Risk
the expected behavior that triggered by this PR
How has this PR been tested?
how to reproduce the test (including hardware information)
Dependency Change?
any library dependency introduced or removed
⛈️ Required checks status: Has failure 🔴
Warning If you do not have the access to re-run the CI-Summary bot, please contact VincyZhang for help. If you push a new commit, all of the workflow will be re-triggered.
Groups summary
🔴 Format Scan Tests workflow
Check ID | Status | Error details | |
---|---|---|---|
format-scan (pylint) | failure | download | ❌ |
format-scan (bandit) | success | ✅ | |
format-scan (cloc) | success | ✅ | |
format-scan (cpplint) | success | ✅ |
These checks are required after the changes to intel_extension_for_transformers/transformers/__init__.py
, intel_extension_for_transformers/transformers/modeling/modeling_auto.py
, intel_extension_for_transformers/transformers/utils/__init__.py
, intel_extension_for_transformers/transformers/utils/config.py
.
🔴 Optimize Unit Test workflow
Check ID | Status | Error details | |
---|---|---|---|
optimize-unit-test-baseline | success | ✅ | |
optimize-unit-test-PR-test | failure | download | ❌ |
Genreate-OptimizeUT-Report | skipped | ❓ |
These checks are required after the changes to intel_extension_for_transformers/transformers/__init__.py
, intel_extension_for_transformers/transformers/modeling/modeling_auto.py
, intel_extension_for_transformers/transformers/utils/__init__.py
, intel_extension_for_transformers/transformers/utils/config.py
.
🔴 NeuralChat Unit Test
Check ID | Status | Error details | |
---|---|---|---|
neuralchat-unit-test-baseline | success | ✅ | |
neuralchat-unit-test-PR-test | failure | download | ❌ |
Generate-NeuralChat-Report | skipped | ❓ |
These checks are required after the changes to intel_extension_for_transformers/transformers/__init__.py
, intel_extension_for_transformers/transformers/modeling/modeling_auto.py
, intel_extension_for_transformers/transformers/utils/__init__.py
, intel_extension_for_transformers/transformers/utils/config.py
.
🔴 Engine Unit Test workflow
Check ID | Status | Error details | |
---|---|---|---|
engine-unit-test-baseline | cancelled | 🚫 | |
engine-unit-test-PR-test | failure | download | ❌ |
Genreate-Engine-Report | skipped | ❓ |
These checks are required after the changes to intel_extension_for_transformers/transformers/__init__.py
, intel_extension_for_transformers/transformers/modeling/modeling_auto.py
, intel_extension_for_transformers/transformers/utils/__init__.py
, intel_extension_for_transformers/transformers/utils/config.py
.
🟢 Chat Bot Test workflow
Check ID | Status | Error details | |
---|---|---|---|
call-inference-llama-2-7b-chat-hf / inference test | success | ✅ | |
call-inference-mpt-7b-chat / inference test | success | ✅ |
These checks are required after the changes to intel_extension_for_transformers/transformers/__init__.py
, intel_extension_for_transformers/transformers/modeling/modeling_auto.py
, intel_extension_for_transformers/transformers/utils/__init__.py
, intel_extension_for_transformers/transformers/utils/config.py
.
Thank you for your contribution! 💜
Note This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact VincyZhang or XuehaoSun for help.
@ftian1 @kevinintel @xin3he @PenghuiCheng please review this PR