tensorrtllm_backend
tensorrtllm_backend copied to clipboard
Trying to compile the latest trtllm (under the v0.12 main branch) in triton 24.07-trtllm-python-py3 reports an error
System Info
triton images:24.07-trtllm-python-py3
Who can help?
No response
Information
- [ ] The official example scripts
- [ ] My own modified scripts
Tasks
- [ ] An officially supported task in the
examplesfolder (such as GLUE/SQuAD, ...) - [ ] My own task or dataset (give details below)
Reproduction
The image comes with v0.11 trtllm, but v0.11 does not support QWEN2-57b, so I want to compile trtllm to the latest main branch, but it cannot be compiled successfully.
Expected behavior
get trtllm whl
actual behavior
[ 80%] Built target context_attention_src
[ 98%] Built target decoder_attention_src
[ 98%] Built target check_symbol_executor
[ 98%] Built target check_symbol
[ 98%] Built target tensorrt_llm
[100%] Built target nvinfer_plugin_tensorrt_llm
[100%] Built target executorWorker
Requirement already satisfied: pybind11-stubgen in /usr/local/lib/python3.10/dist-packages (2.5.1)
WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable.It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning.
Traceback (most recent call last):
File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/usr/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "/usr/local/lib/python3.10/dist-packages/pybind11_stubgen/main.py", line 4, in
- Building wheel...
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/pyproject_hooks/_in_process/_in_process.py", line 373, in
main() File "/usr/local/lib/python3.10/dist-packages/pyproject_hooks/_in_process/_in_process.py", line 357, in main json_out["return_val"] = hook(**hook_input["kwargs"]) File "/usr/local/lib/python3.10/dist-packages/pyproject_hooks/_in_process/_in_process.py", line 271, in build_wheel return _build_backend().build_wheel( File "/usr/local/lib/python3.10/dist-packages/setuptools/build_meta.py", line 415, in build_wheel return self._build_with_temp_dir( File "/usr/local/lib/python3.10/dist-packages/setuptools/build_meta.py", line 397, in _build_with_temp_dir self.run_setup() File "/usr/local/lib/python3.10/dist-packages/setuptools/build_meta.py", line 497, in run_setup super().run_setup(setup_script=setup_script) File "/usr/local/lib/python3.10/dist-packages/setuptools/build_meta.py", line 313, in run_setup exec(code, locals()) File " ", line 84, in File " ", line 48, in sanity_check ImportError: The bindingsmodule does not exist. Please check the package integrity. If you are attempting to use the pip development mode (editable installation), please executebuild_wheels.pyfirst, and then runpip install -e ..
ERROR Backend subprocess exited when trying to invoke build_wheel
Traceback (most recent call last):
File "/gaoziyuan/gzy/tensorrtllm_backend/tensorrt_llm/./scripts/build_wheel.py", line 391, in
additional notes
who can help me.thanks!