fast-llama
fast-llama copied to clipboard
Llama 2 7b chat Q8 guff causes error unknown tokenid
Llama 2 7b chat Q8 guff causes error unknown tokenid Commad ./main -c ./llama-2-7b-chat.Q8_o.gguf -j 40 -n 200-i "Advice "
Error
ERROR:[src/model_loaders/gguf_loader.cpp:320][load_gguf()]Unknown key:tokenizer.ggml.unknown_token_id Failed to load model
+1