Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Load test different models in the inference-server

Open jackapbutler opened this issue 2 years ago • 0 comments

We want to test the performance of different models within the inference server to understand how it scales with model size such as;

jackapbutler avatar Feb 16 '23 15:02 jackapbutler