Ze Han comments

Results 13 comments of


                                            Ze Han

能不能通过领域数据对模型进行微调

请问loss=0，解决了吗，我把llama模型的梯度冻结，只微调lora，训练正常，可是当把llama的embed层的梯度设置为True，然后同时训练embd与lora，loss只有第一个有值，然后一直是0.

代码：import torch model = LlamaForCausalLM.from_pretrained('decapoda-research/llama-7b-hf',device_map='auto',cache_dir='./cache/',load_in_8bit = True) model.resize_token_embeddings(len(merge_tokenizer)) from transformers import TrainingArguments, Trainer, DataCollatorForLanguageModeling trainArgs = TrainingArguments( output_dir= '../ckps', do_train=True, per_device_train_batch_size=4, gradient_accumulation_steps=4, evaluation_strategy="steps", save_strategy="steps", save_steps=1000, eval_steps=100, logging_steps=1, warmup_steps=100, num_train_epochs=2, learning_rate=3e-4,...

Ze Han

能不能通过领域数据对模型进行微调

能不能通过领域数据对模型进行微调

能不能通过领域数据对模型进行微调

能不能通过领域数据对模型进行微调

[Usage]: Not enough memory when run a 33b model float16 on 2 x L40 GPU (48G)

Multi GPUs training is very slow

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

词表扩充后具体怎么使用？

词表扩充后具体怎么使用？

合并词表报错