data-on-eks icon indicating copy to clipboard operation
data-on-eks copied to clipboard

Benchmark Performance to measure response times for Inference

Open ratnopam opened this issue 1 year ago • 0 comments

Community Note

  • Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
  • Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment

What is the outcome that you are trying to reach?

Use a tool to load test and benchmark Inference latency, throughput, response times to scale Pod and create new nodes at load.

Describe the solution you would like

Use a bench marking tool like fmbt for this purpose.

Describe alternatives you have considered

Additional context

ratnopam avatar Feb 24 '24 01:02 ratnopam