worker-vllm
worker-vllm copied to clipboard
Feature request: environment variables for reasoning
Hi,
I am trying to deploy a reasoning model via Runpod Serverless (qwq from Qwen). However, the environment variables for enable-reasoning and reasoning-parser are missing. The flags were found in the official documentation of vllm.
Is it possible to bring these environment variables in a future release?
related commit https://github.com/runpod-workers/worker-vllm/commit/30dd7c1eb50355a894cd600791e72c5afa3dbefe
Wait, it's not complete. I've just spotted the strange commit, where this feature was commented out
@hase808 I use this custom CMD as temporary fix:
/bin/bash -c 'if ! grep -q "enable_reasoning=True" /src/engine.py; then sed -i "s/# enable_reasoning=/enable_reasoning=True, #/" /src/engine.py && sed -i "s/# reasoning_parser=/reasoning_parser=\"deepseek_r1\", #/" /src/engine.py; fi && python3 /src/handler.py'