Harry Mellor
Harry Mellor
I don't think you need to call `ray start` to use tensor parallel anymore. Are you still experiencing this issue?
I'll close this as stale for now
I have successfully used both GPTQ and AWQ models with vLLM. Should this issue be considered solved @WoosukKwon?
Closing as this was resolved by #2330
Closing as a duplicate of #187
Closed by https://github.com/vllm-project/vllm/pull/4539
@zhuohan123 can this work be considered complete?
Closing as this should now be fixed.
x86 CPU support was added in https://github.com/vllm-project/vllm/pull/3634 Since there are other issues asking for specific architectures, I will close this one as complete because there is now a CPU only...
Closing because a single worker will now only us Ray if the user specifies `--worker-use-ray`