vanna Max context length exceeded in __rpc_call(method="generate

Max context length exceeded in __rpc_call(method="generate_questions")

Open bhuvanrawal opened this issue 1 year ago • 1 comments

{'error': {'code': -32603, 'message': 'BadRequestError(\'Error code: 400 - {\\\'error\\\': {\\\'message\\\': "This model\\\'s maximum context length is 16385 tokens. However, you requested 16583 tokens (16083 in the messages, 500 in the completion). Please reduce the length of the messages or completion.", \\\'type\\\': \\\'invalid_request_error\\\', \\\'param\\\': \\\'messages\\\', \\\'code\\\': \\\'context_length_exceeded\\\'}}\')'}, 'jsonrpc': '2.0'}

Jan 01 '24 04:01 bhuvanrawal

So this happens if you have training data chunks that are too large and consume the entire context window.

Try to do vn.get_training_data() and then sort by the string length of the content.

You may need some alternate strategies to break up the training data into smaller chunks.

If you'd prefer not to do that, another alternative is to use your own OpenAI API Key and send it to the new GPT-4-Turbo with 128k context, which should be able to accept these larger chunks.

https://vanna.ai/docs/snowflake-openai-standard-vannadb.html

Jan 03 '24 15:01 zainhoda

vanna vanna copied to clipboard

Max context length exceeded in __rpc_call(method="generate_questions")

vanna
vanna copied to clipboard