binmakeswell
binmakeswell
Hi @bobo0810 We have replied it at #2937 Thanks.
This issue was closed due to inactivity. Thanks.
Hi @lupingllp and @grafail Could you please provide more details about your issue? It's hard for us to follow/reproduce/fix it now. Thanks.
Hi @lupingllp and @grafail , just as the bug hint, you need to import the CodeGenAttention class rather than only replace the model. We are also working at 'lazy init',...
We have updated a lot. Please check the latest code. This issue was closed due to inactivity. Thanks.
Hi @pilipala818 Could you please provide more details about your issue? It's hard for us to follow/reproduce/fix it now. Thanks.
We have updated a lot. Please check the latest code. This issue was closed due to inactivity. Thanks.
> titan pipeline启动太慢了,单机8卡 3090卡,pp=4, tp=2,启动训练需要等待10多分钟,30B的模型A100 80G 启动训练要卡30分钟以上, Megatron就启动很快。。。 Hi @joan126 Can you open a new issue and provide details? So we can reproduce your question. Thanks. https://github.com/hpcaitech/ColossalAI/issues/new/choose
> I got this error too when useing pytorch 1.13.0, colossalai 0.2.8 with transformers 4.28.1 and 4.24.0 How about PyTorch 1.12.1?
Glad to hear it was resolved. Thanks.