Open-Assistant
Open-Assistant copied to clipboard

Published 20 hours ago •

Reame
Issues

Load test different models in the inference-server

Open jackapbutler opened this issue 2 years ago • 0 comments

We want to test the performance of different models within the inference server to understand how it scales with model size such as;

Feb 16 '23 15:02 jackapbutler