helix icon indicating copy to clipboard operation
helix copied to clipboard

Ideas for different scheduling strategies

Open philwinder opened this issue 1 year ago • 1 comments

  1. Imagine the situation where you are under constant load from a single model type. Then a user comes in with another model type. It will never get scheduled.

  2. Imagine prod. We have a lot of machines. It's annoying that image models are constantly evicted for text models, because they take a while to load. It would be great if we could pin models.

3... more?

philwinder avatar Dec 05 '24 14:12 philwinder

Related to #602

philwinder avatar Dec 05 '24 15:12 philwinder