danger-dream comments

Results 10 comments of


                                            danger-dream

More flexible script monitoring types

Yes, our suggestion is similar, but the workload of modifying the existing monitoring code part is a bit heavy, so my suggestion is to add script type monitoring

[BUG/Help] <title>关于max_source_length 和max_target_length 的一些问题

1. max_source_length、max_target_length是输入输出文本的向量长度，可以自己用tokenizer.encode计算 2. 官方建议不超过2048，可以翻下以前的issues，有说过 3. 计入 4. 会

[Feature] <title>训练1M数据需要多久

这个要看数据和ptuning参数才能判断的。我这v100 40G，max_source_length 1024、max_target_length 512，其他参数不变的清空下，3万多数据需要12小时

> @ray-008 一直没懂, max_steps 1000 到底是啥意思? max_steps好像是只计算1000条数据? 看代码这个字段可以不填, 不填写会自动根据数据集的条数来计算step, 貌似你这50w条数据需要的时间会更长吧? 共训练 max_steps 步，每步 per_device_train_batch_size * gradient_accumulation_steps 条数据共训练 max_steps * (per_device_train_batch_size * gradient_accumulation_steps) 条数据

danger-dream

More flexible script monitoring types

[BUG/Help] <title>关于max_source_length 和max_target_length 的一些问题

api问题

拉个群呗，老哥，免费测试呀！

[Feature] <title>训练1M数据需要多久

[Feature] <title>训练1M数据需要多久

[Feature] <title>训练1M数据需要多久

chaglm2 loRA finetuning

微信或者word里面不支持划词

请问可以补充下开源协议吗？