sample-app-aoai-chatGPT icon indicating copy to clipboard operation
sample-app-aoai-chatGPT copied to clipboard

Chat load management by switching models

Open mdsa3d opened this issue 9 months ago • 0 comments

Is your feature request related to this sample app, or to an Azure service, such as Azure OpenAI or Azure AI Search?

Allowing to limit using certain models for a pre-defined amount and then switch to other models for rest of the interactions.

Is your feature request related to a problem? Please describe.

I cannot switch between models during conversation.

Describe the solution you'd like

I would like to set a limit of 10 questions using GPT4, once the limit is reached then it will automatically switch to GPT3.5 for next 3 hours.

Is this feature specific to your use case or your organization, or would it apply broadly across other uses of this app?

No, everyone can use this feature, and it will allow others to manage the cost of the chatbot efficiently.

Describe alternatives you've considered

Setting the token limit at a model level.

mdsa3d avatar May 14 '24 08:05 mdsa3d