worker-vllm
worker-vllm copied to clipboard
[mc] add Env and modelpath util
Note: You can only load one model at a time, Hence in quick deploy only single model is assigned.
see https://github.com/runpod-workers/worker-sglang/pull/18#issuecomment-2646023036