LLM-Tuning issues

ImportError: dlopen: cannot load any more object with static TLS

1

Traceback (most recent call last): File "/usr/local/anaconda3/envs/test39/lib/python3.9/site-packages/sklearn/__check_build/__init__.py", line 45, in from ._check_build import check_build # noqa ImportError: dlopen: cannot load any more object with static TLS During handling of the...

hjing100

Why epoch in log is different from progress

Thanks for your work. I wanna ask a question why epoch in log is different from progress. I have used the command to run the lora tuning with 8 gpus....

jimmy-walker

ChatGLM2按照readme教程微调了，但是没有效果！！！

19

按照readme里面的过程，自己设置了一个自我介绍的数据，一步步的Tokenization和lora训练loss都为很0.0001了，然后按照model= PeftModel.from_pretrained(model, "/home/llm/ChatGLM2-6B/finetuning/weights").half()加载微调后的，自我介绍还是没变（原厂的自我介绍）。求大神解答思路或者大神的微调过程

HelixPark

多卡训练感觉不是并发的?

8

看了一下GPU的使用率,是一个个跳100%的,你们有没有这种情况?

shenmadouyaowen

使用ChatGLM2-6B分词报错

(tuning) [yons@Ubuntu 17:54:44] ~/work/tuning/LLM-Tuning $ python3 tokenize_dataset_rows.py --model_checkpoint /home/yons/work/glm/ChatGLM2-6B/THUDM/chatglm2-6b --input_file CMeiE-train.json --prompt_key q --target_key a --save_name simple_math_4op --max_seq_length 2000 --skip_overlength False Downloading and preparing dataset generator/default to file:///home/yons/.cache/huggingface/datasets/generator/default-35c7964d6cacead3/0.0.0... Traceback (most...

kyle-hy

code lamma微调脚本可以使用baichuan2的吗

xhaoss

请教一个问题，chatglm2在用lora微调时，不添加attention mask也可以么？

2

`def data_collator(features: list) -> dict: len_ids = [len(feature["input_ids"]) for feature in features] longest = max(len_ids) input_ids = [] labels_list = [] for ids_l, feature in sorted(zip(len_ids, features), key=lambda x: -x[0]):...

annw0922

model.hf_device_map 不存在如何解决呢

chatglmconditionalgeneeration object has no attribute hf_device_map ![image](https://github.com/beyondguo/LLM-Tuning/assets/44887637/a73f32de-2591-4c38-88e6-21dd36066316)

LivinLuo1993

为什么ppo model 需要接AutoModelForCausalLMWithValueHead呢？

1

感谢工作！请问这里 ppo model 为什么要接一个valuehead 呢？ https://github.com/beyondguo/LLM-Tuning/blob/ed68123815bc0add9ad2d7ddc2a48dc584db2c94/RLHF/rl_training.py#L185C1-L185C11 这个head好像随机初始化的？

jiahuanluo

chaglm-6b lora微调执行到指定的eval_step后提示“iteration over a 0-d tensor”

1

chaglm-6b lora微调执行到指定的eval_step后提示“iteration over a 0-d tensor”，故障如下所示： ![image](https://github.com/beyondguo/LLM-Tuning/assets/44887637/69414a91-c8ce-4f55-970f-434147a2bc5a) 代码如下： `def train_v2(model, train_data, val_data): writer = SummaryWriter() world_size = int(os.environ.get("WORLD_SIZE", 1)) ddp = world_size != 1 train_args = TrainingArguments( output_dir=args.output_path, do_train=True, per_device_train_batch_size=4,...

LivinLuo1993

LLM-Tuning
LLM-Tuning copied to clipboard

Metadata

ImportError: dlopen: cannot load any more object with static TLS

Why epoch in log is different from progress

ChatGLM2按照readme教程微调了，但是没有效果！！！

多卡训练感觉不是并发的?

使用ChatGLM2-6B分词报错

code lamma微调脚本可以使用baichuan2的吗

请教一个问题，chatglm2在用lora微调时，不添加attention mask也可以么？

model.hf_device_map 不存在如何解决呢

为什么ppo model 需要接AutoModelForCausalLMWithValueHead呢？

chaglm-6b lora微调执行到指定的eval_step后提示“iteration over a 0-d tensor”

← Metadata

Owner

Metadata

LLM-Tuning LLM-Tuning copied to clipboard

Metadata

← Metadata

Owner

Metadata

LLM-Tuning
LLM-Tuning copied to clipboard