mlc-llm [Bug] When I enable "<|im_end|>" as stop_str in qwen2 configuration, the final output seems to be truncated.

[Bug] When I enable "<|im_end|>" as stop_str in qwen2 configuration, the final output seems to be truncated.

Open Moxoo opened this issue 1 year ago • 4 comments

🐛 Bug

To Reproduce

Steps to reproduce the behavior:

1.Do not set <|im_end|>. Of course, my fine-tuned qwen2 model will output <im_end>. But the problem is that it is not a separate token (Id 151645 is <im_end>), it is a token together with the last part (} in JSON) of my expected output,In my case it is }<im Oq21CehHkD