Karan Pathak

Results 1 issues of Karan Pathak

Hello everybody, The server hangs when the GPU KV cache usage reaches 10%. ## Issue in Detail I attempted to serve the Llama2 7B Hugging Face model via vLLM on...