Carlos Alas

Results 3 comments of Carlos Alas

@m0n1ker I would like to try your solution. Where you able to create the PR?

Hi, This worked for me: ```python -m vllm.entrypoints.api_server --host 0.0.0.0 --port 8000``` And when you run the image, add the port as well: ``docker run --gpus all -it --rm -p...

Glad it helped. Sorry, I only have 1 GPU, so I cannot run that test.