Zhengfu He
Zhengfu He
Hi, Did you train the model with DDP? If so, the state dict keys may be different.
感谢您的PR. 看了一下autogptq的安装,默认会重装torch和cuda ext。这对于多数用户来说感觉不够友好,能否为MOSS设计一个pip install 的最小依赖集合,可以在现有的环境上便捷地安装?
请查看这个issue https://github.com/OpenLMLab/MOSS/issues/65
这是个warning,应该不会影响实际效果
尝试这样更新主函数 model = MossForCausalLM.from_pretrained("fnlp/moss-moon-003-sft-int4").half().cuda() infer = Inference(model, device_map="auto")
We are currently working on it. The workaround may be slower.
报错的triton版本是什么呢?