belle9217

Results 11 issues of belle9217

AttributeError: 'Model' object has no attribute 'update' when i change the parameter forward_only as True ,then run train.py ,finally, i get the mistake,i don't know why,can you help me?

爬虫很久,最后每个文件夹都是空的,不知道为什么,委屈巴巴

File "/slurmhome/qqq/.local/lib/python3.11/site-packages/transformers/models/llama/modeling_llama.py", line 333, in forward ^^^^^^^^^^^^^^^^^ File "/slurmhome/qqq/.local/lib/python3.11/site-packages/transformers/models/llama/modeling_llama.py", line 184, in apply_rotary_pos_emb query_states, key_states = apply_rotary_pos_emb(query_states, key_states, cos, sin, position_ids) cos = cos[position_ids].unsqueeze(1) # [bs, 1, seq_len, dim] ~...

when I run get_list_of_datasets.py ,i get "Number of datasets 0" ,this result means that there is no data in huggingface using this author,how can i get the data. Thanks for...

huggingface上openllama-13b的模型大小为26.4G,转换为huggingface那种模型格式之后模型大小为24.7G,这也就是大概是以fp16或者是bf16保存的模型,那这样的话,请问我在加载这个权重时候需要设置什么参数吗,还是直接加载这种半精度的权重进行训练?ps:我试了直接用半精度的权重进行训练,模型的生成结果重复的比较多,我在想是不是要把权重转换为fp32那种,还是全程都用fp16那种进行训练,因此影响了模型的精度。

你好,我用该项目训练中sentencepiece训练了一个中文词表,和falcon的英文词表无法合并,使用AutoTokenizer加载的falcon英文词表,没有sp_model属性,请问该怎么解决呢

when i run predict.py,there is a mistake:TypeError: sampled_loss() got an unexpected keyword argument 'logits'.I think it's a model paramater mistake,but I can't find where it is,can you help me?