litellm icon indicating copy to clipboard operation
litellm copied to clipboard

[Feature]: No more routing to this model after health checking for problem servers

Open wac81 opened this issue 1 year ago • 4 comments

The Feature

[Feature]: No more routing to this model after health checking for problem servers

Motivation, pitch

[Feature]: No more routing to this model after health checking for problem servers

Twitter / LinkedIn details

No response

wac81 avatar Apr 21 '24 15:04 wac81

Hey @wac81 what does that mean? We already support model cooldowns https://docs.litellm.ai/docs/routing#cooldowns

krrishdholakia avatar Apr 23 '24 20:04 krrishdholakia

@krrishdholakia What I mean is that after the health check, it is found that the server has a problem, the route does not request that server, which means that the server is temporarily offline

wac81 avatar May 07 '24 06:05 wac81

Oh that's interesting

maybe what we can do is expose a update_deployment() function in the router and have the health check update deployment status if it's failing

krrishdholakia avatar May 07 '24 14:05 krrishdholakia

Yes, I think this is very important, you can open update_deployment(), and then give a simple update_deployment() implementation example

wac81 avatar May 17 '24 12:05 wac81

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.

github-actions[bot] avatar Jan 29 '25 00:01 github-actions[bot]