DeepSeek-Coder-V2 icon indicating copy to clipboard operation
DeepSeek-Coder-V2 copied to clipboard

ollama Model is configured wrong: Double BOS

Open Snowman-25 opened this issue 1 year ago • 2 comments

Whenever I run deepseek-coder-v2:latest through ollama, the following Error pops up in the log for each prompt:

llm_tokenizer_bpe::check_double_bos_eos: Added a BOS token to the prompt as specified by the model but the prompt also starts with a BOS token. So now the final prompt starts with 2 BOS tokens. Are you sure this is what you want?

I believe that this is due to tokenizer.ggml.add_bos_token being set to true but the template also already having a <|begin▁of▁sentence|> token.

I'm not sure how this affects hallucinations of the model

Snowman-25 avatar Sep 19 '24 10:09 Snowman-25

same issue for me, any update?

Arvin2focus avatar Jan 21 '25 08:01 Arvin2focus

same issue for me

jianjunchang avatar Oct 13 '25 02:10 jianjunchang