dspy enh(dspy): Move max_tokens explicitally into `GPT3` constructor

enh(dspy): Move max_tokens explicitally into `GPT3` constructor

Open harrysalmon opened this issue 10 months ago • 4 comments

Makes the reason for output truncation clearer as per #730

Also adds a comment to the predict file where the max tokens are modified before the call is made.

Mar 28 '24 19:03 harrysalmon

This is mainly during retries etc. It's not very straightforward to change.

Mar 30 '24 21:03 okhat

Basically, the user's max_tokens is used at first, then max_tokens / 2 if we need more fields, but 75 is sort of like a min_max_tokens. Setting a large enough max_tokens could be all that's required here?

Mar 30 '24 21:03 okhat

Thanks for taking a look @okhat

Yeah the main thing for me is putting it into the init so it's clear to the user that they need to set it. What do you think of this part?

In terms of the retries, do you mean if max_tokens doesn't leave the LLM enough tokens to pass the full input tokens, it'll retry with max_tokens / 2 and then with 75?

Mar 31 '24 19:03 harrysalmon

@harrysalmon just following up on this. I think we can merge the change but leave the existing retry behavior. lmk if it's ready to merge.

Apr 13 '24 01:04 arnavsinghvi11

Closing for now to avoid breaking cache errors.

May 06 '24 01:05 arnavsinghvi11

dspy dspy copied to clipboard

enh(dspy): Move max_tokens explicitally into `GPT3` constructor

dspy
dspy copied to clipboard