Comparison with deepspeed inference?

Open allanj opened this issue 2 years ago • 1 comments

as title mentioned

Aug 02 '23 16:08 allanj

Thank you for your attention. As deepspeed primarily focuses on the static inference performance of models, while we are more concerned with the throughput of the entire inference service, we haven't compared it directly. We would greatly appreciate it if you could provide some comparative results.

Aug 03 '23 03:08 shihaobai