vllm icon indicating copy to clipboard operation
vllm copied to clipboard

when --gpu-memory-utilization is set to 0.9, while actually the fraction of gpu memory utilization is more than 0.9

Open meichangsu1 opened this issue 2 years ago • 2 comments

微信图片_20240104164759

meichangsu1 avatar Jan 04 '24 08:01 meichangsu1

I don't know the vLLM internals well, but if you don't set worker-use-ray or engine-use-ray, some parts are not sent to ray.remote, and so gpu_memory_utilization is not used in these cases.

Maybe the gpu-memory-utilization could be more explicit about that but I'm not sure it's an issue btw.

FlorianJoncour avatar Jan 06 '24 13:01 FlorianJoncour

+1 , same question

sdw12138 avatar Jan 08 '24 09:01 sdw12138