Guanghui Qin

Results 1 comments of Guanghui Qin

I have the same issue with vllm 0.6.4.post1. I worked with 4 A100 GPUs, and the generation slows down until completely hangs. Not sure if the hanging was related to...