How do you rate limit calls to llms configured through langchaingo?

Open vivekrao1985 opened this issue 1 year ago • 1 comments

We're currently using langchaingo to connect to a gpt-4o model deployed on Azure. Is there any documentation or examples I can refer to on how to set token rate limits?

Thanks!

Jun 28 '24 17:06 vivekrao1985

I had the same question but for Groq and couldn't find anything in the Docs.

Jun 05 '25 19:06 LarsArtmann