llmperf icon indicating copy to clipboard operation
llmperf copied to clipboard

fix: subsequent requests cannot be sent until 'num_concurrent_requests' requests have all finished in non-block mode

Open llsj14 opened this issue 7 months ago • 0 comments

issues

https://github.com/ray-project/llmperf/issues/43 https://github.com/ray-project/llmperf/issues/56

Summary

  • Subsequent requests cannot be sent until whole requests have all finished even in non-block mode.
  • Fixing the request launcher was challenging due to its dependency on Ray, so I used multiple threads and request launchers, each holding one client and controlling only one request.

llsj14 avatar Jul 01 '24 08:07 llsj14