Elisabeth Shevtsova

Results 1 comments of Elisabeth Shevtsova

> It's not super well documented but you need to just pass in "-max-lora-rank 64" or whatever when serving since default is 16. > > python -m vllm.entrypoints.openai.api_server --max-lora-rank 64...