langtest icon indicating copy to clipboard operation
langtest copied to clipboard

Rate Limit feature for Azure/OpenAI results generation

Open Jeff-ZYX opened this issue 11 months ago • 1 comments

Is your feature request related to a problem? Please describe. Current architecture doesn't seem to support API instances with inbuilt rate limits, causing errors in generation

Describe the solution you'd like A sleep function that can be configured to change rates of generation of GPT completions.

Jeff-ZYX avatar Feb 27 '24 14:02 Jeff-ZYX

Hi @Jeff-ZYX

We acknowledge and appreciate the information you brought to our attention. Currently, we do not have a built-in rate restriction capability in our architecture for API instances. We understand that controlling the rate of generation is crucial to prevent errors and ensure error-free operation for GPT completions. We thank you for recommending the addition of a programmable sleep function to efficiently regulate the generating rate. We value your input and will prioritize this feature request for future updates, even though we are unable to offer a quick fix.

chakravarthik27 avatar Feb 28 '24 18:02 chakravarthik27