DefTruth

Results 214 comments of DefTruth

@yucornetto hi~ would you like to review this PR?

maybe relate to https://github.com/vllm-project/vllm/pull/5207

@youkaichao close, seems the latest vllm (up to #5410) has fixed this problem. (TP0T 45ms v0.4.2 -> 39ms v0.5, eager mode) ```bash [I][2024-06-11 16:31:36][ 1/20][ 1/20 5%] session:0 turn:0 req:0...

> In tensorrt_llm_backend, when we launch several server by MPI with world_size > 1, only the rank 0 (main process) will recieve/return requests. Other ranks will skip this step and...