Kevin Taylor

Results 3 issues of Kevin Taylor

Created integration files (following format of other providers) for users to easily use the following Cerebras models with the repo: cerebras/llama-3.3-70b cerebras/llama3.1-8b cerebras/llama-4-scout-17b-16e-instruct

# why The Cerebras integration was out of date when it came to models and also nonfunctional during testing. # what changed Updated the Cerebras provider file to work with...

Use exponential backoff to improve UX with Cerebras models. Since the hourly and daily rate limits for tokens are the same, TPM is the limiter -> max retry wait time...