minigpt4.cpp icon indicating copy to clipboard operation
minigpt4.cpp copied to clipboard

[error] translate text from english to chinese

Open subineru opened this issue 1 year ago • 2 comments

utf8_error

subineru avatar Jul 23 '23 13:07 subineru

Hi, @Maknee. Could you recheck this?

I found a reliable way to reproduce this issue:

from huggingface_hub import hf_hub_download
import minigpt4_library

model_path = hf_hub_download(
    repo_id="maknee/minigpt4-13b-ggml",
    filename="minigpt4-13B-f16.bin",
    repo_type="dataset",
)

llm_model_path = hf_hub_download(
    repo_id="maknee/ggml-vicuna-v0-quantized",
    filename="ggml-vicuna-13B-v0-q5_k.bin",
    repo_type="dataset",
)

minigpt4_chatbot = minigpt4_library.MiniGPT4ChatBot(model_path, llm_model_path)

for output in minigpt4_chatbot.generate(
    "tell me about yourself",
    limit=1024,
    temp=0,
    top_k=0,
    top_p=0,
    repeat_penalty=1,
):
    print(output)

Using an emoticon as prompt makes it fail even earlier (it fails before printing anything on the terminal):

for output in minigpt4_chatbot.generate(
     "😀",
    limit=1024,
    temp=0,
    top_k=0,
    top_p=0,
    repeat_penalty=1,
):
    print(output)

I think minigpt4.cpp would require a similar solution like the one applied here:

  • https://github.com/abetlen/llama-cpp-python/pull/118

felladrin avatar Aug 02 '23 16:08 felladrin

Thanks @felladrin. Interesting, I'll have to look into this with that PR in mind. Unfortunately, I haven't tested any other input besides english characters/words.

Maknee avatar Aug 02 '23 20:08 Maknee