nivren
Results
1
comments of
nivren
same issue here. vllm only utilized 1 cpu core for each GPU, and all utilized cpu cores kept 100% usage.