data-on-eks
data-on-eks copied to clipboard
Benchmark Performance to measure response times for Inference
Community Note
- Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
- Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request
- If you are interested in working on this issue or have submitted a pull request, please leave a comment
What is the outcome that you are trying to reach?
Use a tool to load test and benchmark Inference latency, throughput, response times to scale Pod and create new nodes at load.
Describe the solution you would like
Use a bench marking tool like fmbt for this purpose.