codingma comments

Results 88 comments of


                                            codingma

Merge multiple LoRA by weights

目前项目暂不支持，你可以在项目外部使用官方文档方法先 merge好weight，然后再放到本项目里使用。

多机多卡跑zero3怎么分配每一台机器上是一个完整的模型？

zero3 就是模型参数分布式分布，以卡的维度来分配，而不是机器的维度，你这个属于特殊需求了，应该不支持。

vLLM是否已经支持lora？

目前不支持，你可以合并导出模型后再使用完整版模型即可。

For pretraining with LORA what is the expected output? A Lora Adapator or the complete pretrained model with adaptor merged in it?

The direct product is LoRA adaptor. Then You can merge it into base model, like this https://github.com/hiyouga/LLaMA-Factory/tree/main/examples/merge_lora to get a complete model. bless.

阿里云v100微调chatglm3-6b,显存并没使用多少,出现OutOfMemoryError: CUDA out of memory

未说明使用了什么参数设置来训练，无法判断问题。

阿里云v100微调chatglm3-6b,显存并没使用多少,出现OutOfMemoryError: CUDA out of memory

额，我还是不知道你是在做什么训练。至少比如你是参考哪个脚本，作的是预训练，还是SFT，还是什么。

does `save_strategy` conflicts with `save_total_limit`?

Try to change the value of "save_strategy" to "steps", and set "save_steps" to a very large value ? It may help you .

Slow batched evals

Try to set per_device_eval_batch_size with 4 or 2, and see the speed difference.

我想测评并打分，我微调后的某一领域的模型。那么我的自定义领域测评数据集格式如何构建，数据集大小为多少合适？

测评的数据集跟微调时候的数据集格式一样啊大小没有固定标准，业务自身觉得OK 就可以了

请问，有计划支持九天大模型吗？

不是主流模型，建议有需求的同学，可以阅读一下源码，自主完成 template.py 和 constant.py 的配置