codingma comments

Results 76 comments of


                                            codingma

Merge multiple LoRA by weights

目前项目暂不支持，你可以在项目外部使用官方文档方法先 merge好weight，然后再放到本项目里使用。

多机多卡跑zero3怎么分配每一台机器上是一个完整的模型？

zero3 就是模型参数分布式分布，以卡的维度来分配，而不是机器的维度，你这个属于特殊需求了，应该不支持。

vLLM是否已经支持lora？

目前不支持，你可以合并导出模型后再使用完整版模型即可。

For pretraining with LORA what is the expected output? A Lora Adapator or the complete pretrained model with adaptor merged in it?

The direct product is LoRA adaptor. Then You can merge it into base model, like this https://github.com/hiyouga/LLaMA-Factory/tree/main/examples/merge_lora to get a complete model. bless.

阿里云v100微调chatglm3-6b,显存并没使用多少,出现OutOfMemoryError: CUDA out of memory

未说明使用了什么参数设置来训练，无法判断问题。

阿里云v100微调chatglm3-6b,显存并没使用多少,出现OutOfMemoryError: CUDA out of memory

额，我还是不知道你是在做什么训练。至少比如你是参考哪个脚本，作的是预训练，还是SFT，还是什么。