exceed the model's predefined maximum length (4096)

Open zhoumengbo opened this issue 9 months ago • 2 comments

When I use the Qwen2.5-Math-7B model for inference, I get the following information:

This is a friendly reminder - the current text generation call will exceed the model's predefined maximum length (4096). Depending on the model, you may observe exceptions, performance degradation, or nothing at all.

What is the context length and maximum number of output tokens for this model?

Apr 08 '25 02:04 zhoumengbo

how to expand the maximum length?

Apr 13 '25 03:04 LaoWangGB

Sorry to disturb you. Did you reproduce the results of Qwen2.5 math base models provided by the paper? I only achieved ~70% acc on Gsm8K dataset, which is largely inconsistent with that in the paper.

Jun 19 '25 05:06 1998v7