Yi Luo

Results 1 comments of Yi Luo

Which parameter in vLLM server corresponds to TGI's --max-concurrent-requests? When can the VLLM_ENGINE_MAX_CONCURRENT_REQUESTS parameter be used?