lmdeploy
lmdeploy copied to clipboard
[Bug] qwen3-vl 在线服务启动报错 Error code: 400 - {'message': 'Logprobs or top_logprobs requested but not enabled logprobs_mode in engine configuration.', 'type': 'invalid_request_error', 'code': 400, 'param': None, 'object': 'error'}
Checklist
- [x] 1. I have searched related issues but cannot get the expected help.
- [x] 2. The bug has not been fixed in the latest version.
- [x] 3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
Describe the bug
环境ubuntu22.04,A800 2025.11.13 使用git clone 最新代码 cd lmdeploy pip install -r requirements/build.txt pip install -e . -v
安装成功后执行
lmdeploy serve api_server /model/Qwen3-VL-8B-sft
--dtype auto
--server-port 23333
--tp 1
--model-name qwen3-vl-sft
--max-batch-size 32
--cache-max-entry-count 0.9
请求服务推理时报错“Error code: 400 - {'message': 'Logprobs or top_logprobs requested but not enabled logprobs_mode in engine configuration.', 'type': 'invalid_request_error', 'code': 400, 'param': None, 'object': 'error'}”,请问这是什么原因
Reproduction
lmdeploy serve api_server /model/Qwen3-VL-8B-sft
--dtype auto
--server-port 23333
--tp 1
--model-name qwen3-vl-sft
--max-batch-size 32
--cache-max-entry-count 0.9 \
Environment
环境版本详细信息如下:
Package Version Editable project location
------------------------- ------------- -------------------------------------
accelerate 1.11.0
addict 2.4.0
aiohappyeyeballs 2.6.1
aiohttp 3.13.2
aiosignal 1.4.0
annotated-doc 0.0.4
annotated-types 0.7.0
anyio 4.11.0
attrs 25.4.0
certifi 2025.11.12
charset-normalizer 3.4.4
click 8.2.1
cmake 4.1.2
cmake-build-extension 0.6.1
distro 1.9.0
einops 0.8.1
fastapi 0.121.1
filelock 3.20.0
fire 0.7.1
frozenlist 1.8.0
fsspec 2025.10.0
gitdb 4.0.12
GitPython 3.1.45
h11 0.16.0
hf-xet 1.2.0
httpcore 1.0.9
httpx 0.28.1
huggingface-hub 0.36.0
idna 3.11
Jinja2 3.1.6
jiter 0.12.0
jsonschema 4.25.1
jsonschema-specifications 2025.9.1
lmdeploy 0.10.2 /code/SFT/lmdeploy
markdown-it-py 4.0.0
MarkupSafe 3.0.3
mdurl 0.1.2
mmengine-lite 0.10.7
mpmath 1.3.0
msgpack 1.1.2
multidict 6.7.0
networkx 3.5
ninja 1.13.0
numpy 2.3.4
nvidia-cublas-cu12 12.8.4.1
nvidia-cuda-cupti-cu12 12.8.90
nvidia-cuda-nvrtc-cu12 12.8.93
nvidia-cuda-runtime-cu12 12.8.90
nvidia-cudnn-cu12 9.10.2.21
nvidia-cufft-cu12 11.3.3.83
nvidia-cufile-cu12 1.13.1.3
nvidia-curand-cu12 10.3.9.90
nvidia-cusolver-cu12 11.7.3.90
nvidia-cusparse-cu12 12.5.8.93
nvidia-cusparselt-cu12 0.7.1
nvidia-nccl-cu12 2.27.3
nvidia-nvjitlink-cu12 12.8.93
nvidia-nvtx-cu12 12.8.90
openai 2.7.2
openai-harmony 0.0.8
packaging 25.0
partial-json-parser 0.2.1.1.post6
peft 0.14.0
pillow 12.0.0
pip 25.2
platformdirs 4.5.0
prometheus_client 0.23.1
propcache 0.4.1
protobuf 6.33.0
psutil 7.1.3
pybind11 2.13.1
pydantic 2.12.4
pydantic_core 2.41.5
Pygments 2.19.2
PyYAML 6.0.3
pyzmq 27.1.0
ray 2.51.1
referencing 0.37.0
regex 2025.11.3
requests 2.32.5
rich 14.2.0
rpds-py 0.28.0
safetensors 0.6.2
sentencepiece 0.2.1
setuptools 80.9.0
setuptools-scm 9.2.2
shortuuid 1.0.13
smmap 5.0.2
sniffio 1.3.1
starlette 0.49.3
sympy 1.14.0
termcolor 3.2.0
tiktoken 0.12.0
timm 1.0.22
tokenizers 0.22.1
torch 2.8.0
torchvision 0.23.0
tqdm 4.67.1
transformers 4.57.1
triton 3.4.0
typing_extensions 4.15.0
typing-inspection 0.4.2
urllib3 2.5.0
uvicorn 0.38.0
wheel 0.45.1
xgrammar 0.1.27
yapf 0.43.0
Error traceback