openchat
openchat copied to clipboard
repeat the output content until the maximum output length is set
Using Openchat-3.5-0106 locally will repeat the output content until the maximum output length is set. In other words, the output of the model does not stop automatically.
And the model is loaded with the following warning:
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
May I ask how to solve this kind of problem?
Hi @zestaken, can you provide more information about your local model setup?
I also encountered this problem
I also encountered this problem