nivren

Results 1 comments of nivren

same issue here. vllm only utilized 1 cpu core for each GPU, and all utilized cpu cores kept 100% usage.