tensorrtllm_backend
tensorrtllm_backend copied to clipboard
Qwen2___5-0___5B-Instruct convert_checkpoint error
System Info
x86_64 Intel(R) Xeon(R) Platinum 8378A CPU @ 3.00GHz Ubuntu 20.04.1 LTS NVIDIA A800-SXM4-80GB Driver Version: 550.54.15 CUDA Version: 12.1
Who can help?
No response
Information
- [X] The official example scripts
- [ ] My own modified scripts
Tasks
- [X] An officially supported task in the
examplesfolder (such as GLUE/SQuAD, ...) - [ ] My own task or dataset (give details below)
Reproduction
docker run a tensorrtllm_backend_v0.13.0 container
in the container
cd /app/tensorrt_llm/examples/qwen
python3 convert_checkpoint.py --model_dir /data/qwen/Qwen2___5-0___5B-Instruct/
--output_dir /data/trtllm_models/tllm_checkpoint_1gpu_fp16_Qwen2___5-0___5B-Instruct
--tp_size 1
--dtype float16
Expected behavior
convert_checkpoint suscess
actual behavior
root@devserver:/app/tensorrt_llm/examples/qwen# python3 convert_checkpoint.py --model_dir /data/qwen/Qwen2___5-0___5B-Instruct/ \
--output_dir /data/trtllm_models/tllm_checkpoint_1gpu_fp16_Qwen2___5-0___5B-Instruct \ --tp_size 1 \ --dtype float16
[TensorRT-LLM] TensorRT-LLM version: 0.13.0
0.13.0
197it [00:00, 484.62it/s]
Traceback (most recent call last):
File "/app/tensorrt_llm/examples/qwen/convert_checkpoint.py", line 303, in