Guanghui Qin
Results
1
comments of
Guanghui Qin
I have the same issue with vllm 0.6.4.post1. I worked with 4 A100 GPUs, and the generation slows down until completely hangs. Not sure if the hanging was related to...