OpenLLM
OpenLLM copied to clipboard
feat(client): support wait time for serverless startup time
ideally, when the bento running on bentocloud is on serverless, the client should be able to retry connection until the pod is alive