ghkl98
ghkl98
 
Failed to generate chat completion, detail: [address=0.0.0.0:35103, pid=3767] Model model_format='pytorch' model_size_in_billions=13 quantizations=['4-bit', '8-bit', 'none'] model_id='' model_hub='huggingface' model_uri='/root/Chinese-llama' model_revision=None is not for chat.
我想部署自己的本地模型,写的model.json如下 { "version": 1, "context_length": 2048, "model_name": "customer-llama-2", "model_lang": [ "en", "zh" ], "model_ability": [ "chat" ], "model_specs": [ { "model_format": "pytorch", "model_size_in_billions": 13, "quantizations": [ "4-bit", "8-bit", "none" ],...