beyondguo comments

Results 59 comments of


                                            beyondguo

多卡运行后报错

你打印一下 `print(model.hf_device_map)` 看看具体layer是怎么分配的（先确保你加载模型的时候使用的是 device_map="auto"）。

这个微调代码不能直接用来baichuan-13B模型的微调？在13B一直报错

每个模型结构可能会有不一样，代码应该需要小的改动。近期我抽空调试一下

请教大佬一个问题，关于输入长度

训练比推理消耗的显存肯定更大很多，只能试试降低batch，或者开启量化之类的操作了。

chatglm2报错：ValueError: weight is on the meta device, we need a `value` to put in on 0

这个好像是个accelerate包的一些问题，你可以先自行查阅一下，比如https://discuss.huggingface.co/t/meta-device-error-while-instantiating-model/33402 另外你使用的各种包的版本跟我的一致吗？你的device是什么？

怎么训练多轮对话呀

暂时还没支持这个

lora tuning 出的权重，再加一个合并的功能？

`PefModel`类中有一个`merge_and_unload`函数，可以先试试。

RuntimeError: Expected is_sm80 to be true, but got false.

pytorch版本？

RuntimeError: Expected is_sm80 to be true, but got false.

奇怪，但我使用的torch2.0

报错“KeyError: 'transformer.embedding'”

https://github.com/beyondguo/LLM-Tuning/issues/8

tuning之后似乎glm6b模型的基础对话能力也消失了。

跟你微调的数据关系很大，你用的啥数据微调的？