ctransformers
ctransformers copied to clipboard
Streaming decode issue
Hello, for llama when decoding Chinese or Japanese characters, since one character mgith need 2 or more tokens to decode, so when streaming, the chunk returned one token decode result is wrong,
is there a way to resolve this?
llama.cpp actually didn't have this issue.