Baichuan-7B icon indicating copy to clipboard operation
Baichuan-7B copied to clipboard

有关哪个模型推理

Open leizhu1989 opened this issue 1 year ago • 4 comments

Required prerequisites

Questions

你好,请问我用 https://huggingface.co/baichuan-inc/baichuan-7B/tree/main 上的模型推理,出现错误: Traceback (most recent call last): File "/home/ctgcdt/anaconda3/envs/baichuanGPT/lib/python3.9/site-packages/transformers/modeling_utils.py", line 446, in load_state_dict return torch.load(checkpoint_file, map_location="cpu") File "/home/ctgcdt/anaconda3/envs/baichuanGPT/lib/python3.9/site-packages/torch/serialization.py", line 797, in load with _open_zipfile_reader(opened_file) as opened_zipfile: File "/home/ctgcdt/anaconda3/envs/baichuanGPT/lib/python3.9/site-packages/torch/serialization.py", line 283, in init super().init(torch._C.PyTorchFileReader(name_or_buffer)) RuntimeError: PytorchStreamReader failed reading zip archive: failed finding central directory

During handling of the above exception, another exception occurred:

Traceback (most recent call last): File "/home/ctgcdt/anaconda3/envs/baichuanGPT/lib/python3.9/site-packages/transformers/modeling_utils.py", line 450, in load_state_dict if f.read(7) == "version": File "/home/ctgcdt/anaconda3/envs/baichuanGPT/lib/python3.9/codecs.py", line 322, in decode (result, consumed) = self._buffer_decode(data, self.errors, final) UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 64: invalid start byte

During handling of the above exception, another exception occurred:

Traceback (most recent call last): File "/home/ctgcdt/zhulei/NLP/baichuan-7B/test.py", line 5, in model = AutoModelForCausalLM.from_pretrained("/media/sdc/baichuan/baichuan-7B", device_map="auto", trust_remote_code=True) File "/home/ctgcdt/anaconda3/envs/baichuanGPT/lib/python3.9/site-packages/transformers/models/auto/auto_factory.py", line 462, in from_pretrained return model_class.from_pretrained( File "/home/ctgcdt/anaconda3/envs/baichuanGPT/lib/python3.9/site-packages/transformers/modeling_utils.py", line 2542, in from_pretrained state_dict = load_state_dict(resolved_archive_file) File "/home/ctgcdt/anaconda3/envs/baichuanGPT/lib/python3.9/site-packages/transformers/modeling_utils.py", line 462, in load_state_dict raise OSError( OSError: Unable to load weights from pytorch checkpoint file for '/media/sdc/baichuan/baichuan-7B/pytorch_model.bin' at '/media/sdc/baichuan/baichuan-7B/pytorch_model.bin'. If you tried to load a PyTorch model from a TF 2.0 checkpoint, please set from_tf=True.

请问这个模型是tf训练的吗,要在哪里修改呢

Checklist

  • [X] I have provided all relevant and necessary information above.
  • [X] I have chosen a suitable title for this issue.

leizhu1989 avatar Jun 21 '23 09:06 leizhu1989

推理代码: from transformers import AutoModelForCausalLM, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("baichuan-inc/baichuan-7B", trust_remote_code=True) model = AutoModelForCausalLM.from_pretrained("baichuan-inc/baichuan-7B", device_map="auto", trust_remote_code=True) inputs = tokenizer('登鹳雀楼->王之涣\n夜雨寄北->', return_tensors='pt') inputs = inputs.to('cuda:0') pred = model.generate(**inputs, max_new_tokens=64,repetition_penalty=1.1) print(tokenizer.decode(pred.cpu()[0], skip_special_tokens=True))

leizhu1989 avatar Jun 21 '23 09:06 leizhu1989

遇到了同样问题,请问怎么解决的?

adzhua avatar Jun 22 '23 03:06 adzhua

@adzhua 还没解决,等大佬回复

leizhu1989 avatar Jun 25 '23 01:06 leizhu1989

看起来是模型文件损坏了,删了重下试试有用没?

Louis-y-nlp avatar Jun 30 '23 06:06 Louis-y-nlp